I am Minhui Xie, a fifth-year Ph.D. student from Tsinghua University, advised by Professor Youyou Lu and Jiwu Shu.
I am a system researcher. My research focus is building efficient systems for at-scale machine learning, with emerging hardware (e.g., persistent memory, modern GPUs).
I am so excited about the interact field between ML and System.
- Frugal: Efficient and Economic Embedding Model Training with Commodity GPUs.
Minhui Xie,
Shaoxun Zeng,
Hao Guo,
Shiwei Gao,
Youyou Lu,
The 30th Conference on Architectural Support for Programming Languages and Operating Systems
(ASPLOS'25),
2025
Paper
- Medusa: Accelerating Serverless LLM Inference with Materialization.
Shaoxun Zeng,
Minhui Xie,
Shiwei Gao,
Youmin Chen,
Youyou Lu,
The 30th Conference on Architectural Support for Programming Languages and Operating Systems
(ASPLOS'25),
2025
Paper
- MaxEmbed: Maximizing SSD Bandwidth Utilization for Huge Embedding Models Serving.
Ruwen Fan,
Minhui Xie,
Haodi Jiang,
Youyou Lu,
The 29th Conference on Architectural Support for Programming Languages and Operating Systems
(ASPLOS'24),
2024
Paper
- Challenges and Technical Development of Large Model Training Storage Systems.
冯杨洋,
汪庆,
谢旻晖,
舒继武,
计算机研究与发展
2024
Paper
- PetPS: Supporting Huge Embedding Models with Persistent Memory.
Minhui Xie,
Youyou Lu,
Qing Wang,
Yangyang Feng,
Jiaqiang Liu,
Kai Ren,
Jiwu Shu,
The 49th International Conference on Very Large Data Bases
(VLDB'23),
2023
Paper
Slides
Star
- Citron: Distributed Range Lock Management with One-sided RDMA.
Jian Gao,
Youyou Lu,
Minhui Xie,
Qing Wang,
Jiwu Shu,
The 21st USENIX Conference on File and Storage Technologies
(FAST'23),
2023
Paper
- Patronus: High-Performance and Protective Remote Memory.
Bin Yan,
Youyou Lu,
Qing Wang,
Minhui Xie,
Jiwu Shu,
The 21st USENIX Conference on File and Storage Technologies
(FAST'23),
2023
Paper
Slides
Star
- Mobius: Fine Tuning Large-scale Models on Commodity GPU Servers.
Yangyang Feng,
Minhui Xie,
Zijie Tian,
Shuo Wang,
Youyou Lu,
Jiwu Shu,
The 28th Conference on Architectural Support for Programming Languages and Operating Systems
(ASPLOS'23),
2023
Paper
Slides
- A Recommendation Model Inference System with GPU Direct Storage Access.
谢旻晖,
陆游游,
冯杨洋,
舒继武,
计算机研究与发展
2024
Paper
- Pacman: An Efficient Compaction Approach for Log-Structured Key-Value Store on Persistent Memory.
Jing Wang,
Youyou Lu,
Qing Wang,
Minhui Xie,
Keji Huang,
Jiwu Shu,
USENIX Annual Technical Conference
(USENIX ATC'22),
2022
Paper
Slides
Star
- Fleche: An Efficient GPU Embedding Cache for Personalized Recommendations.
Minhui Xie,
Youyou Lu,
Jiazhen Lin,
Qing Wang,
Jian Gao,
Kai Ren,
Jiwu Shu,
The 17th European Conference on Computer Systems
(EuroSys'22),
2022
Paper
Slides
- Nap: Persistent Memory Indexes for NUMA Architectures.
Qing Wang,
Youyou Lu,
Junru Li,
Minhui Xie,
Jiwu Shu,
ACM Transactions on Storage
(TOS),
2022
Paper
- Kraken: Memory Efficient Continual Learning for Large-Scale Real-Time Recommendations.
Minhui Xie,
Kai Ren,
Youyou Lu,
Guangxu Yang,
Qingxing Xu,
Bihai Wu,
Jiazhen Lin,
Hongbo Ao,
Wanhong Xu,
Jiwu Shu,
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
(SC'20),
2020
Paper
Slides
Star