代表性论文
会议论文
- Frugal: Efficient and Economic Embedding Model Training with Commodity GPUs.
The 30th Conference on Architectural Support for Programming Languages and Operating Systems
(ASPLOS'25),
2025
Paper
- Medusa: Accelerating Serverless LLM Inference with Materialization.
The 30th Conference on Architectural Support for Programming Languages and Operating Systems
(ASPLOS'25),
2025
Paper
- Achieving Wire-Latency Storage Systems by Exploiting Hardware ACKs.
The 22nd USENIX Symposium on Networked Systems Design and Implementation
(NSDI'25),
2025
Paper
- Fast State Restoration in LLM Serving with HCache.
The 20th European Conference on Computer Systems
(EuroSys'25),
2025
Paper
- Deft: A Scalable Tree Index for Disaggregated Memory.
The 20th European Conference on Computer Systems
(EuroSys'25),
2025
Paper
- Fast Core Scheduling with Userspace Process Abstraction.
The 30th ACM Symposium on Operating Systems Principles
(SOSP'24),
2024
Paper
- Ares-Flash: Efficient Parallel Integer Arithmetic Operations Using NAND Flash Memory.
57th Annual IEEE/ACM International Symposium on Microarchitecture
(MICRO-57),
2024
Paper
- MaxEmbed: Maximizing SSD Bandwidth Utilization for Huge Embedding Models Serving.
The 29th Conference on Architectural Support for Programming Languages and Operating Systems
(ASPLOS'24),
2024
Paper
- Volley: Accelerating Write-Read Orders in Disaggregated Storage.
The 19th European Conference on Computer Systems
(EuroSys'24),
2024
Paper
- Exploring the Asynchrony of Slow Memory Filesystem with EasyIO.
The 19th European Conference on Computer Systems
(EuroSys'24),
2024
Paper
- TeRM: Extending RDMA-Attached Memory with SSD.
The 22nd USENIX Conference on File and Storage Technologies
(FAST'24),
2024
Paper
Code
- Revisiting Secondary Indexing in LSM-based Storage Systems with Persistent Memory.
USENIX Annual Technical Conference
(USENIX ATC'23),
2023
Paper
Code
- SingularFS: A Billion-Scale Distributed File System Using a Single Metadata Server.
USENIX Annual Technical Conference
(USENIX ATC'23),
2023
Paper
- PetPS: Supporting Huge Embedding Models with Persistent Memory.
The 49th International Conference on Very Large Data Bases
(VLDB'23),
2023
Paper
Slides
Code
- λ-IO: A Unified IO Stack for Computational Storage.
The 21st USENIX Conference on File and Storage Technologies
(FAST'23),
2023
Paper
Code
- Citron: Distributed Range Lock Management with One-sided RDMA.
The 21st USENIX Conference on File and Storage Technologies
(FAST'23),
2023
Paper
- Patronus: High-Performance and Protective Remote Memory.
The 21st USENIX Conference on File and Storage Technologies
(FAST'23),
2023
Paper
Slides
Code
- Mobius: Fine Tuning Large-scale Models on Commodity GPU Servers.
The 28th Conference on Architectural Support for Programming Languages and Operating Systems
(ASPLOS'23),
2023
Paper
Slides
- Replicating Persistent Memory Key-Value Stores with Efficient RDMA Abstraction.
The 17th USENIX Symposium on Operating Systems Design and Implementation
(OSDI'23),
2023
Paper
- RIO: Order-Preserving and CPU-Efficient Remote Storage Access.
The 18th European Conference on Computer Systems
(EuroSys'23),
2023
Paper
Slides
- SwitchTx: Scalable In-Network Coordination for Distributed Transaction Processing.
Proceedings of The 48th International Conference on Very Large Data Bases
(VLDB'22),
2022
Paper
Slides
- Pacman: An Efficient Compaction Approach for Log-Structured Key-Value Store on Persistent Memory.
USENIX Annual Technical Conference
(USENIX ATC'22),
2022
Paper
Slides
Code
- AlNiCo: SmartNIC-accelerated Contention-aware Request Scheduling for Transaction Processing.
USENIX Annual Technical Conference
(USENIX ATC'22),
2022
Paper
- Fleche: An Efficient GPU Embedding Cache for Personalized Recommendations.
The 17th European Conference on Computer Systems
(EuroSys'22),
2022
Paper
Slides
- InfiniFS: An Efficient Metadata Service for Large-Scale Distributed Filesystems.
USENIX Conference on File and Storage Technologies
(FAST'22),
2022
Paper
- Plor: General Transactions with Predictable, Low Tail Latency.
ACM SIGMOD International Conference on Management of Data
(SIGMOD'22),
2022
Paper
- Sherman: A Write-Optimized Distributed B+Tree Index on Disaggregated Memory.
ACM SIGMOD International Conference on Management of Data
(SIGMOD'22),
2022
Paper
Slides
Code
- Crash Consistent Non-Volatile Memory Express.
The 28th ACM Symposium on Operating Systems Principles
(SOSP'21),
2021
Paper
Slides
Code
- ParaBit: Processing Parallel Bitwise Operations in NAND Flash Memory based SSDs.
54st Annual IEEE/ACM International Symposium on Microarchitecture
(MICRO'21),
2021
Paper
- Max: A Multicore-Accelerated File System for Flash Storage.
USENIX Annual Technical Conference
(USENIX ATC'21),
2021
Paper
Slides
Code
- Nap: A Black-Box Approach to NUMA-Aware Persistent Memory Indexes.
The 15th USENIX Symposium on Operating Systems Design and Implementation
(OSDI'21),
2021
Paper
Code
- Aria: Tolerating Skewed Workloads in Secure In-memory Key-value Stores.
37th IEEE International Conference on Data Engineering
(ICDE'21),
2021
Paper
- Scalable Persistent Memory File System with Kernel-Userspace Collaboration.
USENIX Conference on File and Storage Technologies
(FAST'21),
2021
Paper
- Concordia: Distributed Shared Memory with In-Network Cache Coherence.
USENIX Conference on File and Storage Technologies
(FAST'21),
2021
Paper
- Kraken: Memory Efficient Continual Learning for Large-Scale Real-Time Recommendations.
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
(SC'20),
2020
Paper
Slides
Code
- Write Dependency Disentanglement with HORAE.
The 14th USENIX Symposium on Operating Systems Design and Implementation
(OSDI'20),
2020
Paper
Slides
- μTree: a Persistent B+-Tree with Low Tail Latency.
46th International Conference on Very Large Data Bases
(VLDB'20),
2020
Paper
Slides
- Improving the Concurrency Performance of Persistent Memory Transactions on Multicores.
Design Automation Conference
(DAC'20),
2020
Paper
- CoinPurse: A Device-Assisted File System with Dual Interfaces.
Design Automation Conference
(DAC'20),
2020
Paper
- FlatStore: an Efficient Log-Structured Key-Value Storage Engine for Persistent Memory.
Proceedings of the Twenty-Fifth International Conference on Architectural Support for Programming Languages and Operating Systems
(ASPLOS'20),
2020
Paper
Slides
- No Compromises: Secure NVM with Crash Consistency, Write-Efficiency and High-Performance.
Design Automation Conference
(DAC'19),
2019
Paper
- ASCache: An Approximate SSD Cache for Error-Tolerant Applications.
Design Automation Conference
(DAC'19),
2019
Paper
- Scalable RDMA RPC on Reliable Connection with Efficient Resource Sharing.
Proceedings of the Fourteenth EuroSys Conference
(EuroSys'19),
2019
Paper
Slides
- LerGAN: A Zero-Free, Low Data Movement and PIM-Based GAN Architecture.
51st Annual IEEE/ACM International Symposium on Microarchitecture
(MICRO'18),
2018
Paper
- Locofs: A loosely-coupled metadata service for distributed file systems.
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
(SC'17),
2017
Paper
- Log-structured non-volatile main memory.
Qingda Hu ,
Jinglei Ren ,
Anirudh Badam ,
Jiwu Shu,
Thomas Moscibroda
USENIX Annual Technical Conference
(USENIX ATC'17),
2017
Paper
- Octopus: an RDMA-enabled Distributed Persistent Memory File System.
USENIX Annual Technical Conference
(USENIX ATC'17),
2017
Paper
- A high performance file system for non-volatile main memory.
Proceedings of the Eleventh European Conference on Computer Systems
(EuroSys'16),
2016
Paper
- ParaFS: A log-structured file system to exploit the internal parallelism of flash devices.
USENIX Annual Technical Conference
(USENIX ATC'16),
2016
Paper
- Blurred persistence in transactional persistent memory.
31st Symposium on Mass Storage Systems and Technologies
(MSST'15),
2015
Paper
- Loose-ordering consistency for persistent memory.
IEEE 32nd International Conference on Computer Design
(ICCD'14),
2014
Paper
- ReconFS: A reconstructable file system on flash storage.
Proceedings of the 12th USENIX Conference on File and Storage Technologies
(FAST'14),
2014
Paper
- Aegis: Partitioning data block for efficient recovery of stuck-at-faults in phase change memory.
Jie Fan ,
Song Jiang ,
Jiwu Shu,
Youhui Zhang ,
Weimin Zhen
Proceedings of the 46th Annual IEEE/ACM International Symposium on Microarchitecture
(MICRO'13),
2013
Paper
- Extending the lifetime of flash-based storage through reducing write amplification from file systems.
Proceedings of the 12th USENIX Conference on File and Storage Technologies
(FAST'13),
2013
Paper
期刊论文
- Efficiently Enlarging RDMA-Attached Memory with SSD .
ACM Transactions on Storage
(TOS),
2024
Paper
- Perseid: A Secondary Indexing Mechanism for LSM-based Storage Systems .
ACM Transactions on Storage
(TOS),
2024
Paper
- Building Write-Optimized Tree Indexes on Disaggregated Memory .
SIGMOD Record, March 2023 (Vol. 52, No. 1)
(SIGMOD Record),
2023
Paper
- TH-iSSD: Design and Implementation of a Generic and Reconfigurable Near-Data Processing Framework .
Transactions on Embedded Computing Systems
(ACM TECS),
2023
Paper
- Efficient Crash Consistency for NVMe over PCIe and RDMA .
ACM Transactions on Storage
(TOS),
2023
Paper
- Nap: Persistent Memory Indexes for NUMA Architectures .
ACM Transactions on Storage
(TOS),
2022
Paper
- Reprogramming 3D TLC Flash Memory based Solid State Drives .
ACM Transactions on Storage
(TOS),
2022
Paper
- Octopus+: an RDMA-enabled Distributed Persistent Memory File System .
ACM Transactions on Storage
(TOS),
2021
Paper
- LrGAN: A Compact and Energy Efficient PIM-based Architecture for GAN Training .
IEEE Transactions on Computers
(TC),
2021
Paper
- TH-DPMS: Design and Implementation of an RDMA-enabled Distributed Persistent Memory Storage System .
ACM Transactions on Storage
(TOS),
2020
Paper
- Towards Unaligned Writes Optimization in Cloud Storage with High-performance SSDs .
IEEE Transactions on Parallel and Distributed Systems
(TPDS),
2020
Paper
- ShieldNVM: An Efficient and Fast Recoverable System for Secure Non-Volatile Memory .
ACM Transactions on Storage
(TOS),
2020
Paper
- Cross-Rack-Aware Single Failure Recovery for Clustered File Systems .
IEEE Transactions on Dependable and Secure Computing
(TDSC),
2020
Paper
- Mitigating Synchronous I/O Overhead in File Systems on Open-Channel SSDs .
ACM Transactions on Storage
(TOS),
2019
Paper
- Correlation-Aware Stripe Organization for Efficient Writes in Erasure-Coded Storage: Algorithms and Evaluation .
IEEE Transactions on Parallel and Distributed Systems
(TPDS),
2019
Paper
- A Flattened Metadata Service for Distributed File Systems .
IEEE Transactions on Parallel and Distributed Systems
(TPDS),
2018
Paper
- Efficient and Consistent NVMM Cache for SSD-based File System .
IEEE Transactions on Computers
(TC),
2018
Paper
- HiNFS: A Persistent Memory File System with Both Buffering and Direct-Access .
ACM Transactions on Storage
(TOS),
2018
Paper
- Encoding-Aware Data Placement for Efficient Degraded Reads in XOR-Coded Storage Systems: Algorithms and Evaluation .
IEEE Transactions on Parallel and Distributed Systems
(TPDS),
2018
Paper
- FlashKV: Accelerating KV performance with open-channel SSDs .
ACM Transactions on Embedded Computing Systems
(TECS),
2017
Paper
- Seek-efficient i/o optimization in single failure recovery for xor-coded storage systems .
IEEE Transactions on Parallel and Distributed Systems
(TPDS),
2017
Paper
- Short code: An efficient RAID-6 MDS code for optimizing degraded reads and partial stripe writes .
IEEE Transactions on Computers
(TC),
2017
Paper
- Parity-switched data placement: Optimizing partial stripe writes in xor-coded storage systems .
IEEE Transactions on Parallel and Distributed Systems
(TPDS),
2016
Paper
- Hv code: An all-around mds code for raid-6 storage systems .
IEEE Transactions on Parallel and Distributed Systems
(TPDS),
2016
Paper
- Reconsidering single disk failure recovery for erasure coded storage systems: Optimizing load balancing in stack-level .
IEEE Transactions on Parallel and Distributed Systems
(TPDS),
2016
Paper
- Blurred persistence: Efficient transactions in persistent memory .
ACM Transactions on Storage
(TOS),
2016
Paper
- Supporting system consistency with differential transactions in flash-based SSDs .
IEEE Transactions on Computers
(TC),
2016
Paper
- Caco: An efficient cauchy coding approach for cloud storage systems .
IEEE Transactions on Computers
(TC),
2016
Paper
- High-performance and lightweight transaction support in flash-based SSDs .
IEEE Transactions on Computers
(TC),
2015
Paper
- Redistribute Data to Regain Load Balance during RAID-4 Scaling .
IEEE Transactions on Parallel and Distributed Systems
(TPDS),
2015
Paper
- Design and implementation of an asymmetric block-based parallel file system .
IEEE Transactions on Computers
(TC),
2014
Paper
- Generalized X-code: An efficient RAID-6 code for arbitrary size of disk array .
ACM Transactions on Storage
(TOS),
2012
Paper
- Preventing Silent Data Corruptions from Propagating During Data Reconstruction .
IEEE Transactions on Computers
(TC),
2010
Paper
- SOPA: Selecting the Optimal Policy Adaptively for a cache system .
ACM Transactions on Storage
(TOS),
2010
Paper
- DACO: A High Performance Disk Architecture Designed Specially for Large Scale Erasure Coded Storage Systems .
IEEE Transactions on Computers
(TC),
2010
Paper
- ALV: A New Data Redistribution Approach to RAID-5 Scaling .
IEEE Transactions on Computers
(TC),
2010
Paper
- GRID Codes: Strip-based Erasure Codes with High Fault Tolerance for Storage Systems .
ACM Transactions on Storage
(TOS),
2009
Paper
- SLAS: An Efficient Approach to Scaling Round-robin Striped Volumes .
ACM Transactions on Storage
(TOS),
2007
Paper
- Design and Implementation of an Out-of-Band Virtualization System for Large SANs .
IEEE Transactions on Computers
(TC),
2007
Paper
- Design and Implementation of a SAN System Based on the Fiber Channel Protocol .
IEEE Transactions on Computers
(TC),
2005
Paper
- A Parallel Transient Stability Simulation for Power System .
IEEE Transactions on Power Systems
(TOPS),
2005
Paper
部分中文综述论文
- 大模型训练中的存储挑战与技术发展.
计算机研究与发展,
,
2024
Paper
- 面向高速硬件的高性能文件系统.
中国计算机学会通讯,
20(1): 56-61,
2024
Paper
- 基于GPU直访存储架构的推荐模型预估系统.
计算机研究与发展,
,
2024
Paper
- 分离式数据中心的存储系统研究进展.
中国科学: 信息科学,
ISSN 1674-7267,
2023
Paper
- 在网存储系统研究综述.
计算机研究与发展,
10.7544/issn1000-1239.202220865,
2023
Paper
- 新型存算分离架构技术展望.
中国计算机学会通讯,
18(11): 53-60,
2022
Paper
- 存内计算研究进展.
中国科学:信息科学,
51(2):173-205,
2021
Paper
- 非易失主存的系统软件研究进展.
中国科学:信息科学,
51(6): 869-899,
2021
Paper
- 安全持久性内存存储研究综述.
计算机研究与发展,
57(5): 912-927,
2020
Paper
- 基于RDMA的分布式存储系统研究综述.
计算机研究与发展,
55(2): 227-239,
2019
Paper
- 持久性内存: 从系统软件的角度.
中国计算机学会通讯,
15(1):15-20,
2019
Paper
- 存储虚拟化研究综述.
中国计算机学会通讯,
13(6):14-24,
2017
Paper
- 基于非易失性存储器的存储系统技术研究进展.
科技导报,
34(14): 86-94,
2016
Paper
- 可搜索加密机制研究与进展.
软件学报,
25(4): 880-895,
2014
Paper
- 闪存存储系统综述.
计算机研究与发展,
50(1): 49-59,
2013
Paper
- 安全云存储系统与关键技术综述.
计算机研究与发展,
50(1): 136-145,
2013
Paper
- 重复数据删除技术研究综述.
软件学报,
21(5): 916-929,
2010
Paper
Full Publications