Xupeng Miao

photo_me.jpg

News

Oct 17, 2025 I have received an Amazon Research Award! :confetti_ball:
Aug 23, 2025 AdaServe was accepted by EuroSys 2026. :tada:
Aug 20, 2025 We have received an NSF POSE Award! :confetti_ball:
Jul 14, 2025 FlexLLM was accepted by NSDI 2026. :tada:
Mar 26, 2025 Mirage was accepted by OSDI 2025. :tada:
Feb 8, 2025 SpotServe received an IEEE Micro Top Picks Honorable Mention! :trophy:
Dec 29, 2024 We have received an NVIDIA Academic Award! :confetti_ball:
Nov 27, 2024 I am honored to serve as the Artificat Evaluation Co-Chair of KDD 2025! :mega:
Nov 27, 2024 We will lanuch GenAI Catalyst Tutorial in ASPLOS 2025 & EuroSys 2025. :mega:
Oct 2, 2024 Helix and GraphPipe were accepted by ASPLOS 2025. :tada:

Selected Publications

  1. NSDI

    FlexLLM: Token-Level Co-Serving of LLM Inference and Fine-Tuning with SLO Guarantees

    Proceedings of NSDI Conference 2026

  2. ASPLOS

    SpotServe: Serving Generative Large Language Models on Preemptible Instances (Distinguished Artifact Award), (IEEE Micro Top Picks Honorable Mention)

    Xupeng Miao, Chunan Shi, Jiangfei Duan,  Xiaoli Xi and 3 more authors

    Proceedings of ASPLOS Conference 2024

  3. ASPLOS

    SpecInfer: Accelerating Generative Large Language Model Serving with Speculative Inference and Token Tree Verification

    Xupeng Miao, Gabriele Oliaro, Zhihao Zhang,  Xinhao Cheng and 10 more authors

    Proceedings of ASPLOS Conference 2024

  4. NSDI

    Parcae: Proactive, Liveput-Optimized DNN Training on Preemptible Instances

    Jiangfei Duan1, Ziang Song1Xupeng Miao1,  Xiaoli Xi and 4 more authors

    Proceedings of NSDI Conference 2024

  5. VLDB

    SDPipe: A Semi-Decentralized Framework for Heterogeneity-aware Pipeline-parallel Training

    Xupeng Miao, Yining Shi, Zhi Yang,  Bin Cui and 1 more author

    Proc. VLDB Endow. 2023

  6. VLDB

    Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism

    Xupeng Miao, Yujie Wang, Youhe Jiang,  Chunan Shi and 3 more authors

    Proc. VLDB Endow. 2023

  7. VLDB

    HET: Scaling out Huge Embedding Model Training via Cache-enabled Distributed Framework (Best Scalable Data Science Paper Award)

    Xupeng Miao, Hailin Zhang, Yining Shi,  Xiaonan Nie and 3 more authors

    Proc. VLDB Endow. 2022

  8. SIGMOD

    HET-GMP: A Graph-based System Approach to Scaling Large Embedding Model Training

    Xupeng Miao, Yining Shi, Hailin Zhang,  Xin Zhang and 3 more authors

    In Proceedings of SIGMOD Conference 2022

  9. SIGMOD

    Heterogeneity-Aware Distributed Machine Learning Training via Partial Reduce

    Xupeng Miao, Xiaonan Nie, Yingxia Shao,  Zhi Yang and 3 more authors

    In Proceedings of SIGMOD Conference 2021

Teaching