News
| Oct 17, 2025 | I have received an Amazon Research Award! |
|---|---|
| Aug 23, 2025 | AdaServe was accepted by EuroSys 2026. |
| Aug 20, 2025 | We have received an NSF POSE Award! |
| Jul 14, 2025 | FlexLLM was accepted by NSDI 2026. |
| Mar 26, 2025 | Mirage was accepted by OSDI 2025. |
| Feb 8, 2025 | SpotServe received an IEEE Micro Top Picks Honorable Mention! |
| Dec 29, 2024 | We have received an NVIDIA Academic Award! |
| Nov 27, 2024 | I am honored to serve as the Artificat Evaluation Co-Chair of KDD 2025! |
| Nov 27, 2024 | We will lanuch GenAI Catalyst Tutorial in ASPLOS 2025 & EuroSys 2025. |
| Oct 2, 2024 | Helix and GraphPipe were accepted by ASPLOS 2025. |
Selected Publications
-
NSDI
FlexLLM: Token-Level Co-Serving of LLM Inference and Fine-Tuning with SLO Guarantees
Proceedings of NSDI Conference 2026
-
ASPLOS
SpotServe: Serving Generative Large Language Models on Preemptible Instances (Distinguished Artifact Award), (IEEE Micro Top Picks Honorable Mention)
Xupeng Miao, Chunan Shi, Jiangfei Duan, Xiaoli Xi and 3 more authors
Proceedings of ASPLOS Conference 2024
-
ASPLOS
SpecInfer: Accelerating Generative Large Language Model Serving with Speculative Inference and Token Tree Verification
Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Xinhao Cheng and 10 more authors
Proceedings of ASPLOS Conference 2024
-
NSDI
Parcae: Proactive, Liveput-Optimized DNN Training on Preemptible Instances
Jiangfei Duan1, Ziang Song1, Xupeng Miao1, Xiaoli Xi and 4 more authors
Proceedings of NSDI Conference 2024
-
VLDB
SDPipe: A Semi-Decentralized Framework for Heterogeneity-aware Pipeline-parallel Training
Xupeng Miao, Yining Shi, Zhi Yang, Bin Cui and 1 more author
Proc. VLDB Endow. 2023
-
VLDB
Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism
Xupeng Miao, Yujie Wang, Youhe Jiang, Chunan Shi and 3 more authors
Proc. VLDB Endow. 2023
-
VLDB
HET: Scaling out Huge Embedding Model Training via Cache-enabled Distributed Framework (Best Scalable Data Science Paper Award)
Xupeng Miao, Hailin Zhang, Yining Shi, Xiaonan Nie and 3 more authors
Proc. VLDB Endow. 2022
-
SIGMOD
HET-GMP: A Graph-based System Approach to Scaling Large Embedding Model Training
Xupeng Miao, Yining Shi, Hailin Zhang, Xin Zhang and 3 more authors
In Proceedings of SIGMOD Conference 2022
-
SIGMOD
Heterogeneity-Aware Distributed Machine Learning Training via Partial Reduce
Xupeng Miao, Xiaonan Nie, Yingxia Shao, Zhi Yang and 3 more authors
In Proceedings of SIGMOD Conference 2021
Teaching
- Purdue University, CS 59200-MLS Machine Learning Systems: fall 2024, fall 2025