Xupeng Miao
Email: xupeng@purdue.edu

Xupeng Miao is a Kevin C. and Suzanne L. Kahn New Frontiers Assistant Professor in the Department of Computer Science at Purdue University. Before that, he was a Post Doctoral Fellow working with Prof. Zhihao Jia and Prof. Tianqi Chen in Catalyst Group and Parallel Data Lab at Computer Science Department of Carnegie Mellon University. He received his Ph.D. degree in computer science from Peking University in June 2022, supervised by Prof. Bin Cui, and his Bachelor’s degree from Northeastern University. He is the creator of Hetu, a highly efficient distributed deep learning system, and continuously leading the team development. He is broadly interested in machine learning systems, data management and distributed computing.
News
Mar 26, 2025 | Mirage was accepted by OSDI 2025. ![]() |
---|---|
Feb 8, 2025 | SpotServe received an IEEE Micro Top Picks Honorable Mention as one of the “most significant research papers in computer architecture based on novelty and potential for long-term impact”! ![]() |
Dec 29, 2024 | We have received an NVIDIA Academic Award! ![]() |
Nov 27, 2024 | I am honored to serve as the Artificat Evaluation Co-Chair of KDD 2025! ![]() |
Nov 27, 2024 | We will lanuch a tutorial on Efficient Systems and Compilers for Generative AI in ASPLOS 2025 & EuroSys 2025. ![]() |
Oct 2, 2024 | Helix and GraphPipe were accepted by ASPLOS 2025. ![]() |
Aug 15, 2024 | Our paper on memory-efficient PEFT won the Outstanding Paper Award of ACL 2024! ![]() |
Aug 13, 2024 | I am honored to serve as the Artificat Evaluation Co-Chair of MLSys 2025! ![]() |
Aug 6, 2024 | One paper on distributed LLM training was accepted by SOSP 2024. ![]() |
Jun 18, 2024 | I was awared WAIC 2024 Yunfan Award · Bright Stars! ![]() |
Selected Publications
- ASPLOSSpotServe: Serving Generative Large Language Models on Preemptible Instances (Distinguished Artifact Award), (IEEE Micro Top Picks Honorable Mention)Proceedings of ASPLOS Conference 2024
- ASPLOSSpecInfer: Accelerating Generative Large Language Model Serving with Speculative Inference and Token Tree VerificationProceedings of ASPLOS Conference 2024
- VLDBSDPipe: A Semi-Decentralized Framework for Heterogeneity-aware Pipeline-parallel TrainingProc. VLDB Endow. 2023
- VLDBGalvatron: Efficient Transformer Training over Multiple GPUs Using Automatic ParallelismProc. VLDB Endow. 2023
- VLDBHET: Scaling out Huge Embedding Model Training via Cache-enabled Distributed Framework (Best Scalable Data Science Paper Award)Proc. VLDB Endow. 2022
- SIGMODHET-GMP: A Graph-based System Approach to Scaling Large Embedding Model TrainingIn Proceedings of SIGMOD Conference 2022
Teaching
- CS 59200-MLS Machine Learning Systems: fall 2024, fall 2025