- Frugal: Efficient and Economic Embedding Model Training with Commodity GPUs.
Minhui Xie,
Shaoxun Zeng,
Hao Guo,
Shiwei Gao,
Youyou Lu,
The 30th Conference on Architectural Support for Programming Languages and Operating Systems
(ASPLOS'25),
2025
PDF
- Medusa: Accelerating Serverless LLM Inference with Materialization.
Shaoxun Zeng,
Minhui Xie,
Shiwei Gao,
Youmin Chen,
Youyou Lu,
The 30th Conference on Architectural Support for Programming Languages and Operating Systems
(ASPLOS'25),
2025
PDF
- Efficiently Enlarging RDMA-Attached Memory with SSD.
Zhe Yang,
Qing Wang,
Xiaojian Liao,
Youyou Lu,
Keji Huang,
Jiwu Shu,
ACM Transactions on Storage
(TOS),
2024
PDF
- Fast Core Scheduling with Userspace Process Abstraction.
Jiazhen Lin,
Youmin Chen,
Shiwei Gao,
Youyou Lu,
The 30th ACM Symposium on Operating Systems Principles
(SOSP'24),
2024
PDF
- A Tale of Two Paths: Toward a Hybrid Data Plane for Efficient Far-Memory Applications.
Lei Chen,
Shi Liu,
Chenxi Wang,
Haoran Ma,
Yifan Qiao,
Zhe Wang,
Chenggang Wu,
Youyou Lu,
Xiaobing Feng,
Huimin Cui,
Shan Lu,
Harry Xu,
18th USENIX Symposium on Operating Systems Design and Implementation
(OSDI'24),
2024
PDF
- Ares-Flash: Efficient Parallel Integer Arithmetic Operations Using NAND Flash Memory.
Jian Chen,
Congming Gao,
Youyou Lu,
Yuhao Zhang,
Jiwu Shu,
57th Annual IEEE/ACM International Symposium on Microarchitecture
(MICRO-57),
2024
PDF
- MaxEmbed: Maximizing SSD Bandwidth Utilization for Huge Embedding Models Serving.
Ruwen Fan,
Minhui Xie,
Haodi Jiang,
Youyou Lu,
The 29th Conference on Architectural Support for Programming Languages and Operating Systems
(ASPLOS'24),
2024
PDF
- Volley: Accelerating Write-Read Orders in Disaggregated Storage.
Shaoxun Zeng,
Xiaojian Liao,
Hao Guo,
Youyou Lu,
The 19th European Conference on Computer Systems
(EuroSys'24),
2024
PDF
- TeRM: Extending RDMA-Attached Memory with SSD.
Zhe Yang,
Qing Wang,
Xiaojian Liao,
Youyou Lu,
Keji Huang,
Jiwu Shu,
The 22nd USENIX Conference on File and Storage Technologies
(FAST'24),
2024
PDF
- Perseid: A Secondary Indexing Mechanism for LSM-based Storage Systems.
Jing Wang,
Youyou Lu,
Qing Wang,
Yuhao Zhang,
Jiwu Shu,
ACM Transactions on Storage
(TOS),
2024
PDF
- 面向高速硬件的高性能文件系统.
Youyou Lu,
Hao Guo,
Shaoxun Zeng,
Yitian Yang,
中国计算机学会通讯
(中国计算机学会通讯),
2024
PDF
- Revisiting Secondary Indexing in LSM-based Storage Systems with Persistent Memory.
Jing Wang,
Youyou Lu,
Qing Wang,
Yuhao Zhang,
Jiwu Shu,
USENIX Annual Technical Conference
(USENIX ATC'23),
2023
PDF
- SingularFS: A Billion-Scale Distributed File System Using a Single Metadata Server.
Hao Guo,
Youyou Lu,
Wenhao Lv,
Xiaojian Liao,
Shaoxun Zeng,
Jiwu Shu,
USENIX Annual Technical Conference
(USENIX ATC'23),
2023
PDF
- PetPS: Supporting Huge Embedding Models with Persistent Memory.
Minhui Xie,
Youyou Lu,
Qing Wang,
Yangyang Feng,
Jiaqiang Liu,
Kai Ren,
Jiwu Shu,
The 49th International Conference on Very Large Data Bases
(VLDB'23),
2023
PDF
- λ-IO: A Unified IO Stack for Computational Storage.
Zhe Yang,
Youyou Lu,
Xiaojian Liao,
Youmin Chen,
Junru Li,
Siyu He,
Jiwu Shu,
The 21st USENIX Conference on File and Storage Technologies
(FAST'23),
2023
PDF
- Citron: Distributed Range Lock Management with One-sided RDMA.
Jian Gao,
Youyou Lu,
Minhui Xie,
Qing Wang,
Jiwu Shu,
The 21st USENIX Conference on File and Storage Technologies
(FAST'23),
2023
PDF
- Patronus: High-Performance and Protective Remote Memory.
Bin Yan,
Youyou Lu,
Qing Wang,
Minhui Xie,
Jiwu Shu,
The 21st USENIX Conference on File and Storage Technologies
(FAST'23),
2023
PDF
- Mobius: Fine Tuning Large-scale Models on Commodity GPU Servers.
Yangyang Feng,
Minhui Xie,
Zijie Tian,
Shuo Wang,
Youyou Lu,
Jiwu Shu,
The 28th Conference on Architectural Support for Programming Languages and Operating Systems
(ASPLOS'23),
2023
PDF
- Replicating Persistent Memory Key-Value Stores with Efficient RDMA Abstraction.
Qing Wang,
Youyou Lu,
Jing Wang,
Jiwu Shu,
The 17th USENIX Symposium on Operating Systems Design and Implementation
(OSDI'23),
2023
PDF
- Building Write-Optimized Tree Indexes on Disaggregated Memory.
Qing Wang,
Youyou Lu,
Jiwu Shu,
SIGMOD Record, March 2023 (Vol. 52, No. 1)
(SIGMOD Record),
2023
PDF
- Efficient Crash Consistency for NVMe over PCIe and RDMA.
Xiaojian Liao,
Youyou Lu,
Zhe Yang,
Jiwu Shu,
ACM Transactions on Storage
(TOS),
2023
PDF
- NICFS: a file system based on persistent memory and SmartNIC.
Yitian Yang,
Youyou Lu,
Frontiers of Information Technology & Electronic Engineering
(FITEE'23),
2023
PDF
- SwitchTx: Scalable In-Network Coordination for Distributed Transaction Processing.
Junru Li,
Youyou Lu,
Yiming Zhang,
Qing Wang,
Zhuo Cheng,
Keji Huang,
Jiwu Shu,
Proceedings of The 48th International Conference on Very Large Data Bases
(VLDB'22),
2022
PDF
- Pacman: An Efficient Compaction Approach for Log-Structured Key-Value Store on Persistent Memory.
Jing Wang,
Youyou Lu,
Qing Wang,
Minhui Xie,
Keji Huang,
Jiwu Shu,
USENIX Annual Technical Conference
(USENIX ATC'22),
2022
PDF
- AlNiCo: SmartNIC-accelerated Contention-aware Request Scheduling for Transaction Processing.
Junru Li,
Youyou Lu,
Qing Wang,
Jiazhen Lin,
Zhe Yang,
Jiwu Shu,
USENIX Annual Technical Conference
(USENIX ATC'22),
2022
PDF
- Fleche: An Efficient GPU Embedding Cache for Personalized Recommendations.
Minhui Xie,
Youyou Lu,
Jiazhen Lin,
Qing Wang,
Jian Gao,
Kai Ren,
Jiwu Shu,
The 17th European Conference on Computer Systems
(EuroSys'22),
2022
PDF
- InfiniFS: An Efficient Metadata Service for Large-Scale Distributed Filesystems.
Wenhao Lv,
Youyou Lu,
Yiming Zhang,
Peile Duan,
Jiwu Shu,
USENIX Conference on File and Storage Technologies
(FAST'22),
2022
PDF
- Sherman: A Write-Optimized Distributed B+Tree Index on Disaggregated Memory.
Qing Wang,
Youyou Lu,
Jiwu Shu,
ACM SIGMOD International Conference on Management of Data
(SIGMOD'22),
2022
PDF
- Nap: Persistent Memory Indexes for NUMA Architectures.
Qing Wang,
Youyou Lu,
Junru Li,
Minhui Xie,
Jiwu Shu,
ACM Transactions on Storage
(TOS),
2022
PDF
- Efficient Atomic Durability on eADR-enabled Persistent Memory.
Taiyu Zhou,
Yajuan Du,
Fan Yang,
Xiaojian Liao,
Youyou Lu,
The 31st International Conference on Parallel Architectures and Compilation Techniques
(PACT'22),
2022
PDF
- Crash Consistent Non-Volatile Memory Express.
Xiaojian Liao,
Youyou Lu,
Zhe Yang,
Jiwu Shu,
The 28th ACM Symposium on Operating Systems Principles
(SOSP'21),
2021
PDF
- ParaBit: Processing Parallel Bitwise Operations in NAND Flash Memory based SSDs.
Congming Gao,
Xin Xin,
Youyou Lu,
Youtao Zhang,
Jun Yang,
Jiwu Shu,
54st Annual IEEE/ACM International Symposium on Microarchitecture
(MICRO'21),
2021
PDF
- Max: A Multicore-Accelerated File System for Flash Storage.
Xiaojian Liao,
Youyou Lu,
Erci Xu,
Jiwu Shu,
USENIX Annual Technical Conference
(USENIX ATC'21),
2021
PDF
- Nap: A Black-Box Approach to NUMA-Aware Persistent Memory Indexes.
Qing Wang,
Youyou Lu,
Junru Li,
Jiwu Shu,
The 15th USENIX Symposium on Operating Systems Design and Implementation
(OSDI'21),
2021
PDF
- Aria: Tolerating Skewed Workloads in Secure In-memory Key-value Stores.
Fan Yang,
Youmin Chen,
Youyou Lu,
Qing Wang,
Jiwu Shu,
37th IEEE International Conference on Data Engineering
(ICDE'21),
2021
PDF
- Scalable Persistent Memory File System with Kernel-Userspace Collaboration.
Youmin Chen,
Youyou Lu,
Bohong Zhu,
Andrea Arpaci-Dusseau,
Remzi Arpaci-Dusseau,
Jiwu Shu,
USENIX Conference on File and Storage Technologies
(FAST'21),
2021
PDF
- Concordia: Distributed Shared Memory with In-Network Cache Coherence.
Qing Wang,
Youyou Lu,
Erci Xu,
Junru Li,
Youmin Chen,
Jiwu Shu,
USENIX Conference on File and Storage Technologies
(FAST'21),
2021
PDF
- Octopus+: an RDMA-enabled Distributed Persistent Memory File System.
Bohong Zhu,
Youmin Chen,
Qing Wang,
Youyou Lu,
Jiwu Shu,
ACM Transactions on Storage
(TOS),
2021
PDF
- Kraken: Memory Efficient Continual Learning for Large-Scale Real-Time Recommendations.
Minhui Xie,
Kai Ren,
Youyou Lu,
Guangxu Yang,
Qingxing Xu,
Bihai Wu,
Jiazhen Lin,
Hongbo Ao,
Wanhong Xu,
Jiwu Shu,
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
(SC'20),
2020
PDF
- Write Dependency Disentanglement with HORAE.
Xiaojian Liao,
Youyou Lu,
Erci Xu,
Jiwu Shu,
The 14th USENIX Symposium on Operating Systems Design and Implementation
(OSDI'20),
2020
PDF
- μTree: a Persistent B+-Tree with Low Tail Latency.
Youmin Chen,
Youyou Lu,
Kedong Fang,
Qing Wang,
Jiwu Shu,
46th International Conference on Very Large Data Bases
(VLDB'20),
2020
PDF
- Improving the Concurrency Performance of Persistent Memory Transactions on Multicores.
Qing Wang,
Youyou Lu,
Zhongjie Wu,
Fan Yang,
Jiwu Shu,
Design Automation Conference
(DAC'20),
2020
PDF
- CoinPurse: A Device-Assisted File System with Dual Interfaces.
Zhe Yang,
Youyou Lu,
Erci Xu,
Jiwu Shu,
Design Automation Conference
(DAC'20),
2020
PDF
- FlatStore: an Efficient Log-Structured Key-Value Storage Engine for Persistent Memory.
Youmin Chen,
Youyou Lu,
Fan Yang,
Qing Wang,
Yang Wang,
Jiwu Shu,
Proceedings of the Twenty-Fifth International Conference on Architectural Support for Programming Languages and Operating Systems
(ASPLOS'20),
2020
PDF
- TH-DPMS: Design and Implementation of an RDMA-enabled Distributed Persistent Memory Storage System.
Jiwu Shu,
Youmin Chen,
Qing Wang,
Bohong Zhu,
Junru Li,
Youyou Lu,
ACM Transactions on Storage
(TOS),
2020
PDF
- Towards Unaligned Writes Optimization in Cloud Storage with High-performance SSDs.
Jiwu Shu,
Fei Li,
Siyang Li,
Youyou Lu,
IEEE Transactions on Parallel and Distributed Systems
(TPDS),
2020
PDF
- ShieldNVM: An Efficient and Fast Recoverable System for Secure Non-Volatile Memory.
Fan Yang,
Youmin Chen,
Haiyu Mao,
Youyou Lu,
Jiwu Shu,
ACM Transactions on Storage
(TOS),
2020
PDF
- OCVM: Optimizing the Isolation of Virtual Machines with Open-Channel SSDs.
Zhe Liu,
Xiaojian Liao,
Fei Li,
Zhe Yang,
Youyou Lu,
Jiwu Shu,
International Conference on Algorithms and Architectures for Parallel Processing
(ICA3PP'20),
2020
PDF
- Understanding and analysis of B+ trees on NVM towards consistency and efficiency.
Jiangkun Hu,
Youmin Chen,
Youyou Lu,
Xubin He,
Jiwu Shu,
CCF Transactions on High Performance Computing
(CCF-THPC),
2020
PDF
- SineKV: Decoupled Secondary Indexing for LSM-based Key-Value Stores.
Fei Li,
Youyou Lu,
Zhe Yang,
Jiwu Shu,
40th IEEE International Conference on Distributed Computing Systems
(ICDCS'20),
2020
PDF
- NovKV: Efficient Garbage Collection for Key-Value Separated LSM-Stores.
Chen Shen,
Youyou Lu,
Fei Li,
Weidong Liu,
Jiwu Shu,
36th International Conference on Massive Storage Systems and Technology
(MSST'20),
2020
PDF
- DIESEL: A Dataset-Based Distributed Storage and Caching System for Large-Scale Deep Learning Training.
Lipeng Wang,
Songgao Ye,
Baichen Yang,
Youyou Lu,
Hequan Zhang,
Shengen Yan,
Qiong Luo,
49th International Conference on Parallel Processing
(ICPP'20),
2020
PDF
- OCStore: Accelerating Distributed Object Storage with Open-Channel SSDs.
Youyou Lu,
Jiacheng Zhang,
Zhe Yang,
Liyang Pan,
Jiwu Shu,
The 39th IEEE International Conference on Distributed Computing Systems
(ICDCS'19),
2019
PDF
- Cognitive SSD: A Deep Learning Engine for Energy-Efficient Data Retrieval.
Shengwen Liang,
Ying Wang,
Youyou Lu,
Zhe Yang,
Huawei Li,
Xiaowei Li,
USENIX Annual Technical Conference
(USENIX ATC'19),
2019
PDF
- No Compromises: Secure NVM with Crash Consistency, Write-Efficiency and High-Performance.
Fan Yang,
Youyou Lu,
Youmin Chen,
Haiyu Mao,
Jiwu Shu,
Design Automation Conference
(DAC'19),
2019
PDF
- ASCache: An Approximate SSD Cache for Error-Tolerant Applications.
Fei Li,
Youyou Lu,
Zhongjie Wu,
Jiwu Shu,
Design Automation Conference
(DAC'19),
2019
PDF
- Scalable RDMA RPC on Reliable Connection with Efficient Resource Sharing.
Youmin Chen,
Youyou Lu,
Jiwu Shu,
Proceedings of the Fourteenth EuroSys Conference
(EuroSys'19),
2019
PDF
- Mitigating Synchronous I/O Overhead in File Systems on Open-Channel SSDs.
Youyou Lu,
Jiwu Shu,
Jiacheng Zhang,
ACM Transactions on Storage
(TOS),
2019
PDF
- Reducing rename overhead in full-path-indexed file system.
Longhua Wang,
Youyou Lu,
Siyang Li,
Fan Yang,
Jiwu Shu,
Advanced Parallel Processing Technologies, 13th International Symposium
(APPT'19),
2019
PDF
- Exporting Transactional Interface to Applications in Log-Structured File Systems.
Jiacheng Zhang,
Youyou Lu,
Keni Qiu,
Zejun Shi,
Hongsuk Choi,
Jiwu Shu,
IEEE International Conference on Networking, Architecture and Storage
(NAS'18),
2018
PDF
- Empirical Study of Transactional Management for Persistent Memory.
Hongping Shu,
Hongyu Chen,
Hao Liu,
Youyou Lu,
Qingda Hu,
Jiwu Shu,
IEEE 7th Non-Volatile Memory Systems and Applications Symposium
(NVMSA'18),
2018
PDF
- A Flattened Metadata Service for Distributed File Systems.
Siyang Li,
Fenlin Liu,
Jiwu Shu,
Youyou Lu,
Tao Li,
Yang Hu,
IEEE Transactions on Parallel and Distributed Systems
(TPDS),
2018
PDF
- Efficient and Consistent NVMM Cache for SSD-based File System.
Youmin Chen,
Youyou Lu,
Pei Chen,
Jiwu Shu,
IEEE Transactions on Computers
(TC),
2018
PDF
- HiNFS: A Persistent Memory File System with Both Buffering and Direct-Access.
Youmin Chen,
Jiwu Shu,
Jiaxin Ou,
Youyou Lu,
ACM Transactions on Storage
(TOS),
2018
PDF
- Locofs: A loosely-coupled metadata service for distributed file systems.
Siyang Li,
Youyou Lu,
Jiwu Shu,
Yang Hu,
Tao Li,
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
(SC'17),
2017
PDF
- Efficient storage management for aged file systems on persistent memory.
Kaisheng Zeng,
Youyou Lu,
Hu Wan,
Jiwu Shu,
Proceedings of the Conference on Design, Automation & Test in Europe
(DATE'17),
2017
PDF
- Octopus: an RDMA-enabled Distributed Persistent Memory File System.
Youyou Lu,
Jiwu Shu,
Youmin Chen,
Tao Li,
USENIX Annual Technical Conference
(USENIX ATC'17),
2017
PDF
- FlashKV: Accelerating KV performance with open-channel SSDs.
Jiacheng Zhang,
Youyou Lu,
Jiwu Shu,
Xiongjun Qin,
ACM Transactions on Embedded Computing Systems
(TECS),
2017
PDF
- Empirical study of redo and undo logging in persistent memory.
Hu Wan,
Youyou Lu,
Yuanchao Xu,
Jiwu Shu,
5th Non-Volatile Memory Systems and Applications Symposium
(NVMSA'16),
2016
PDF
- Run-time performance estimation and fairness-oriented scheduling policy for concurrent GPGPU applications.
Qingda Hu,
Jiwu Shu,
Jie Fan,
Youyou Lu,
45th International Conference on Parallel Processing
(ICPP'16),
2016
PDF
- A high performance file system for non-volatile main memory.
Jiaxin Ou,
Jiwu Shu,
Youyou Lu,
Proceedings of the Eleventh European Conference on Computer Systems
(EuroSys'16),
2016
PDF
- ParaFS: A log-structured file system to exploit the internal parallelism of flash devices.
Jiacheng Zhang,
Jiwu Shu,
Youyou Lu,
USENIX Annual Technical Conference
(USENIX ATC'16),
2016
PDF
- Blurred persistence: Efficient transactions in persistent memory.
Youyou Lu,
Jiwu Shu,
Long Sun,
ACM Transactions on Storage
(TOS),
2016
PDF
- Supporting system consistency with differential transactions in flash-based SSDs.
Youyou Lu,
Jiwu Shu,
Jia Guo,
Peng Zhu,
IEEE Transactions on Computers
(TC),
2016
PDF
- Blurred persistence in transactional persistent memory.
Youyou Lu,
Jiwu Shu,
Long Sun,
31st Symposium on Mass Storage Systems and Technologies
(MSST'15),
2015
PDF
- DP 2: reducing transaction overhead with differential and dual persistency in persistent memory.
Long Sun,
Youyou Lu,
Jiwu Shu,
Proceedings of the 12th ACM International Conference on Computing Frontiers
(CF'15),
2015
PDF
- High-performance and lightweight transaction support in flash-based SSDs.
Youyou Lu,
Jiwu Shu,
Jia Guo,
Shuai Li,
Onur Mutlu,
IEEE Transactions on Computers
(TC),
2015
PDF
- Loose-ordering consistency for persistent memory.
Youyou Lu,
Jiwu Shu,
Long Sun,
Onur Mutlu,
IEEE 32nd International Conference on Computer Design
(ICCD'14),
2014
PDF
- ReconFS: A reconstructable file system on flash storage.
Youyou Lu,
Jiwu Shu,
Wei Wang,
Proceedings of the 12th USENIX Conference on File and Storage Technologies
(FAST'14),
2014
PDF
- Design and implementation of an asymmetric block-based parallel file system.
Letian Yi,
Jiwu Shu,
Ying Zhao,
Yinjin Qing,
Youyou Lu,
Weiming Zheng,
IEEE Transactions on Computers
(TC),
2014
PDF
- LightTx: A lightweight transactional design in flash-based SSDs to support flexible transactions.
Youyou Lu,
Jiwu Shu,
Jia Guo,
Shuai Li,
Onur Mutlu,
IEEE 31st International Conference on Computer Design
(ICCD'13),
2013
PDF
- Extending the lifetime of flash-based storage through reducing write amplification from file systems.
Youyou Lu,
Jiwu Shu,
Weimin Zheng,
Proceedings of the 12th USENIX Conference on File and Storage Technologies
(FAST'13),
2013
PDF