代表性论文

全部论文列表


会议论文

  • Cost-efficient Archive Cloud Storage with Tape: Design and Deployment.
    Qing Wang, Fan Yang, Qiang Liu , Geng Xiao , Yongpeng Chen , Hao Lan, Leiming Chen , Bangzhu Chen , Chenrui Liu , Pingchang Bai , Bin Huang , Zigan Luo , Mingyu Xie , Yu Wang , Youyou Lu, Huatao Wu , Jiwu Shu
    The 24th USENIX Conference on File and Storage Technologies (FAST'26), 2026
    Paper
  • OdinANN: Direct Insert for Consistently Stable Performance in Billion-Scale Graph-Based Vector Search.
    The 24th USENIX Conference on File and Storage Technologies (FAST'26), 2026
    Paper
  • Efficient Multi-LLM Serving with Workload Weaving.
    USENIX Annual Technical Conference (USENIX ATC'25), 2025
    Paper
  • GPreempt: GPU Preemptive Scheduling Made General and Efficient.
    USENIX Annual Technical Conference (USENIX ATC'25), 2025
    Paper
  • Stripeless Data Placement for Erasure-Coded In-Memory Storage.
    The 19th USENIX Symposium on Operating Systems Design and Implementation (OSDI'25), 2025
    Paper
  • Achieving Low-Latency Graph-Based Vector Search via Aligning Best-First Search Algorithm with SSD.
    The 19th USENIX Symposium on Operating Systems Design and Implementation (OSDI'25), 2025
    Paper Code
  • ShiftLock: Mitigate One-sided RDMA Lock Contention via Handover.
    The 23rd USENIX Conference on File and Storage Technologies (FAST'25), 2025
    Paper
  • Frugal: Efficient and Economic Embedding Model Training with Commodity GPUs.
    The 30th Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'25), 2025
    Paper
  • Medusa: Accelerating Serverless LLM Inference with Materialization.
    The 30th Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'25), 2025
    Paper
  • Achieving Wire-Latency Storage Systems by Exploiting Hardware ACKs.
    The 22nd USENIX Symposium on Networked Systems Design and Implementation (NSDI'25), 2025
    Paper
  • Fast State Restoration in LLM Serving with HCache.
    The 20th European Conference on Computer Systems (EuroSys'25), 2025
    Paper
  • Deft: A Scalable Tree Index for Disaggregated Memory.
    The 20th European Conference on Computer Systems (EuroSys'25), 2025
    Paper
  • Designing an Efficient Tree Index on Disaggregated Memory.
    Communications of the ACM (CACM), 2025
    Paper
  • Fast Core Scheduling with Userspace Process Abstraction.
    The 30th ACM Symposium on Operating Systems Principles (SOSP'24), 2024
    Paper
  • A Tale of Two Paths: Toward a Hybrid Data Plane for Efficient Far-Memory Applications.
    Lei Chen , Shi Liu , Chenxi Wang , Haoran Ma , Yifan Qiao , Zhe Wang , Chenggang Wu , Youyou Lu, Xiaobing Feng , Huimin Cui , Shan Lu , Harry Xu
    18th USENIX Symposium on Operating Systems Design and Implementation (OSDI'24), 2024
    Paper
  • Ares-Flash: Efficient Parallel Integer Arithmetic Operations Using NAND Flash Memory.
    57th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-57), 2024
    Paper
  • MaxEmbed: Maximizing SSD Bandwidth Utilization for Huge Embedding Models Serving.
    The 29th Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'24), 2024
    Paper
  • Volley: Accelerating Write-Read Orders in Disaggregated Storage.
    The 19th European Conference on Computer Systems (EuroSys'24), 2024
    Paper
  • Exploring the Asynchrony of Slow Memory Filesystem with EasyIO.
    Bohong Zhu , Youmin Chen, Jiwu Shu
    The 19th European Conference on Computer Systems (EuroSys'24), 2024
    Paper
  • TeRM: Extending RDMA-Attached Memory with SSD.
    The 22nd USENIX Conference on File and Storage Technologies (FAST'24), 2024
    Paper Code
  • Revisiting Secondary Indexing in LSM-based Storage Systems with Persistent Memory.
    USENIX Annual Technical Conference (USENIX ATC'23), 2023
    Paper Code
  • SingularFS: A Billion-Scale Distributed File System Using a Single Metadata Server.
    USENIX Annual Technical Conference (USENIX ATC'23), 2023
    Paper
  • PetPS: Supporting Huge Embedding Models with Persistent Memory.
    Minhui Xie, Youyou Lu, Qing Wang, Yangyang Feng, Jiaqiang Liu , Kai Ren , Jiwu Shu
    The 49th International Conference on Very Large Data Bases (VLDB'23), 2023
    Paper Slides Code
  • λ-IO: A Unified IO Stack for Computational Storage.
    The 21st USENIX Conference on File and Storage Technologies (FAST'23), 2023
    Paper Code
  • Citron: Distributed Range Lock Management with One-sided RDMA.
    The 21st USENIX Conference on File and Storage Technologies (FAST'23), 2023
    Paper
  • Patronus: High-Performance and Protective Remote Memory.
    The 21st USENIX Conference on File and Storage Technologies (FAST'23), 2023
    Paper Slides Code
  • Mobius: Fine Tuning Large-scale Models on Commodity GPU Servers.
    Yangyang Feng, Minhui Xie, Zijie Tian , Shuo Wang , Youyou Lu, Jiwu Shu
    The 28th Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'23), 2023
    Paper Slides
  • Replicating Persistent Memory Key-Value Stores with Efficient RDMA Abstraction.
    The 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI'23), 2023
    Paper
  • RIO: Order-Preserving and CPU-Efficient Remote Storage Access.
    The 18th European Conference on Computer Systems (EuroSys'23), 2023
    Paper Slides
  • SwitchTx: Scalable In-Network Coordination for Distributed Transaction Processing.
    Junru Li, Youyou Lu, Yiming Zhang , Qing Wang, Zhuo Cheng, Keji Huang , Jiwu Shu
    Proceedings of The 48th International Conference on Very Large Data Bases (VLDB'22), 2022
    Paper Slides
  • Pacman: An Efficient Compaction Approach for Log-Structured Key-Value Store on Persistent Memory.
    USENIX Annual Technical Conference (USENIX ATC'22), 2022
    Paper Slides Code
  • AlNiCo: SmartNIC-accelerated Contention-aware Request Scheduling for Transaction Processing.
    USENIX Annual Technical Conference (USENIX ATC'22), 2022
    Paper
  • Fleche: An Efficient GPU Embedding Cache for Personalized Recommendations.
    The 17th European Conference on Computer Systems (EuroSys'22), 2022
    Paper Slides
  • InfiniFS: An Efficient Metadata Service for Large-Scale Distributed Filesystems.
    Wenhao Lv, Youyou Lu, Yiming Zhang , Peile Duan , Jiwu Shu
    USENIX Conference on File and Storage Technologies (FAST'22), 2022
    Paper
  • Plor: General Transactions with Predictable, Low Tail Latency.
    ACM SIGMOD International Conference on Management of Data (SIGMOD'22), 2022
    Paper
  • Sherman: A Write-Optimized Distributed B+Tree Index on Disaggregated Memory.
    ACM SIGMOD International Conference on Management of Data (SIGMOD'22), 2022
    Paper Slides Code
  • Crash Consistent Non-Volatile Memory Express.
    The 28th ACM Symposium on Operating Systems Principles (SOSP'21), 2021
    Paper Slides Code
  • ParaBit: Processing Parallel Bitwise Operations in NAND Flash Memory based SSDs.
    Congming Gao, Xin Xin , Youyou Lu, Youtao Zhang , Jun Yang , Jiwu Shu
    54st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'21), 2021
    Paper
  • Max: A Multicore-Accelerated File System for Flash Storage.
    USENIX Annual Technical Conference (USENIX ATC'21), 2021
    Paper Slides Code
  • Nap: A Black-Box Approach to NUMA-Aware Persistent Memory Indexes.
    The 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI'21), 2021
    Paper Code
  • Aria: Tolerating Skewed Workloads in Secure In-memory Key-value Stores.
    37th IEEE International Conference on Data Engineering (ICDE'21), 2021
    Paper
  • Scalable Persistent Memory File System with Kernel-Userspace Collaboration.
    USENIX Conference on File and Storage Technologies (FAST'21), 2021
    Paper
  • Concordia: Distributed Shared Memory with In-Network Cache Coherence.
    USENIX Conference on File and Storage Technologies (FAST'21), 2021
    Paper
  • Kraken: Memory Efficient Continual Learning for Large-Scale Real-Time Recommendations.
    Minhui Xie, Kai Ren , Youyou Lu, Guangxu Yang , Qingxing Xu , Bihai Wu , Jiazhen Lin, Hongbo Ao , Wanhong Xu , Jiwu Shu
    Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'20), 2020
    Paper Slides Code
  • Write Dependency Disentanglement with HORAE.
    The 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), 2020
    Paper Slides
  • μTree: a Persistent B+-Tree with Low Tail Latency.
    46th International Conference on Very Large Data Bases (VLDB'20), 2020
    Paper Slides
  • Improving the Concurrency Performance of Persistent Memory Transactions on Multicores.
    Design Automation Conference (DAC'20), 2020
    Paper
  • CoinPurse: A Device-Assisted File System with Dual Interfaces.
    Design Automation Conference (DAC'20), 2020
    Paper
  • FlatStore: an Efficient Log-Structured Key-Value Storage Engine for Persistent Memory.
    Proceedings of the Twenty-Fifth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'20), 2020
    Paper Slides
  • No Compromises: Secure NVM with Crash Consistency, Write-Efficiency and High-Performance.
    Design Automation Conference (DAC'19), 2019
    Paper
  • ASCache: An Approximate SSD Cache for Error-Tolerant Applications.
    Fei Li , Youyou Lu, Zhongjie Wu , Jiwu Shu
    Design Automation Conference (DAC'19), 2019
    Paper
  • Scalable RDMA RPC on Reliable Connection with Efficient Resource Sharing.
    Proceedings of the Fourteenth EuroSys Conference (EuroSys'19), 2019
    Paper Slides
  • LerGAN: A Zero-Free, Low Data Movement and PIM-Based GAN Architecture.
    Haiyu Mao, Mingcong Song , Tao Li , Yuting Dai , Jiwu Shu
    51st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'18), 2018
    Paper
  • Locofs: A loosely-coupled metadata service for distributed file systems.
    Siyang Li , Youyou Lu, Jiwu Shu, Yang Hu , Tao Li
    Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'17), 2017
    Paper
  • Log-structured non-volatile main memory.
    Qingda Hu , Jinglei Ren , Anirudh Badam , Jiwu Shu, Thomas Moscibroda
    USENIX Annual Technical Conference (USENIX ATC'17), 2017
    Paper
  • Octopus: an RDMA-enabled Distributed Persistent Memory File System.
    USENIX Annual Technical Conference (USENIX ATC'17), 2017
    Paper
  • A high performance file system for non-volatile main memory.
    Jiaxin Ou , Jiwu Shu, Youyou Lu
    Proceedings of the Eleventh European Conference on Computer Systems (EuroSys'16), 2016
    Paper
  • ParaFS: A log-structured file system to exploit the internal parallelism of flash devices.
    Jiacheng Zhang , Jiwu Shu, Youyou Lu
    USENIX Annual Technical Conference (USENIX ATC'16), 2016
    Paper
  • Blurred persistence in transactional persistent memory.
    Youyou Lu, Jiwu Shu, Long Sun
    31st Symposium on Mass Storage Systems and Technologies (MSST'15), 2015
    Paper
  • Loose-ordering consistency for persistent memory.
    Youyou Lu, Jiwu Shu, Long Sun , Onur Mutlu
    IEEE 32nd International Conference on Computer Design (ICCD'14), 2014
    Paper
  • ReconFS: A reconstructable file system on flash storage.
    Youyou Lu, Jiwu Shu, Wei Wang
    Proceedings of the 12th USENIX Conference on File and Storage Technologies (FAST'14), 2014
    Paper
  • Aegis: Partitioning data block for efficient recovery of stuck-at-faults in phase change memory.
    Jie Fan , Song Jiang , Jiwu Shu, Youhui Zhang , Weimin Zhen
    Proceedings of the 46th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'13), 2013
    Paper
  • Extending the lifetime of flash-based storage through reducing write amplification from file systems.
    Youyou Lu, Jiwu Shu, Weimin Zheng
    Proceedings of the 12th USENIX Conference on File and Storage Technologies (FAST'13), 2013
    Paper



期刊论文

  • Efficiently Enlarging RDMA-Attached Memory with SSD .
    ACM Transactions on Storage (TOS), 2025
    Paper
  • Perseid: A Secondary Indexing Mechanism for LSM-based Storage Systems .
    ACM Transactions on Storage (TOS), 2024
    Paper
  • Building Write-Optimized Tree Indexes on Disaggregated Memory .
    SIGMOD Record, March 2023 (Vol. 52, No. 1) (SIGMOD Record), 2023
    Paper
  • TH-iSSD: Design and Implementation of a Generic and Reconfigurable Near-Data Processing Framework .
    Jiwu Shu, Kedong Fang , Youmin Chen, Shuo Wang
    Transactions on Embedded Computing Systems (ACM TECS), 2023
    Paper
  • Efficient Crash Consistency for NVMe over PCIe and RDMA .
    ACM Transactions on Storage (TOS), 2023
    Paper
  • Nap: Persistent Memory Indexes for NUMA Architectures .
    ACM Transactions on Storage (TOS), 2022
    Paper
  • Reprogramming 3D TLC Flash Memory based Solid State Drives .
    Congming Gao, Min Ye , Chun Jason Xue , Youtao Zhang , Liang Shi , Jiwu Shu, Jun Yang
    ACM Transactions on Storage (TOS), 2022
    Paper
  • Octopus+: an RDMA-enabled Distributed Persistent Memory File System .
    ACM Transactions on Storage (TOS), 2021
    Paper
  • LrGAN: A Compact and Energy Efficient PIM-based Architecture for GAN Training .
    Haiyu Mao, Jiwu Shu, Mingcong Song , Tao Li
    IEEE Transactions on Computers (TC), 2021
    Paper
  • TH-DPMS: Design and Implementation of an RDMA-enabled Distributed Persistent Memory Storage System .
    ACM Transactions on Storage (TOS), 2020
    Paper
  • Towards Unaligned Writes Optimization in Cloud Storage with High-performance SSDs .
    Jiwu Shu, Fei Li , Siyang Li , Youyou Lu
    IEEE Transactions on Parallel and Distributed Systems (TPDS), 2020
    Paper
  • ShieldNVM: An Efficient and Fast Recoverable System for Secure Non-Volatile Memory .
    ACM Transactions on Storage (TOS), 2020
    Paper
  • Cross-Rack-Aware Single Failure Recovery for Clustered File Systems .
    Zhirong Shen, Patrick PC Lee , Jiwu Shu, Wenzhong Guo
    IEEE Transactions on Dependable and Secure Computing (TDSC), 2020
    Paper
  • Mitigating Synchronous I/O Overhead in File Systems on Open-Channel SSDs .
    Youyou Lu, Jiwu Shu, Jiacheng Zhang
    ACM Transactions on Storage (TOS), 2019
    Paper
  • Correlation-Aware Stripe Organization for Efficient Writes in Erasure-Coded Storage: Algorithms and Evaluation .
    Zhirong Shen, Patrick PC Lee , Jiwu Shu, Wenzhong Guo
    IEEE Transactions on Parallel and Distributed Systems (TPDS), 2019
    Paper
  • A Flattened Metadata Service for Distributed File Systems .
    Siyang Li , Fenlin Liu , Jiwu Shu, Youyou Lu, Tao Li , Yang Hu
    IEEE Transactions on Parallel and Distributed Systems (TPDS), 2018
    Paper
  • Efficient and Consistent NVMM Cache for SSD-based File System .
    IEEE Transactions on Computers (TC), 2018
    Paper
  • HiNFS: A Persistent Memory File System with Both Buffering and Direct-Access .
    ACM Transactions on Storage (TOS), 2018
    Paper
  • Encoding-Aware Data Placement for Efficient Degraded Reads in XOR-Coded Storage Systems: Algorithms and Evaluation .
    Zhirong Shen, Patrick PC Lee , Jiwu Shu, Wenzhong Guo
    IEEE Transactions on Parallel and Distributed Systems (TPDS), 2018
    Paper
  • FlashKV: Accelerating KV performance with open-channel SSDs .
    Jiacheng Zhang , Youyou Lu, Jiwu Shu, Xiongjun Qin
    ACM Transactions on Embedded Computing Systems (TECS), 2017
    Paper
  • Seek-efficient i/o optimization in single failure recovery for xor-coded storage systems .
    Zhirong Shen, Jiwu Shu, Patrick PC Lee , Yingxun Fu
    IEEE Transactions on Parallel and Distributed Systems (TPDS), 2017
    Paper
  • Short code: An efficient RAID-6 MDS code for optimizing degraded reads and partial stripe writes .
    Yingxun Fu , Jiwu Shu, Xianghong Luo , Zhirong Shen, Qingda Hu
    IEEE Transactions on Computers (TC), 2017
    Paper
  • Parity-switched data placement: Optimizing partial stripe writes in xor-coded storage systems .
    Zhirong Shen, Jiwu Shu, Yingxun Fu
    IEEE Transactions on Parallel and Distributed Systems (TPDS), 2016
    Paper
  • Hv code: An all-around mds code for raid-6 storage systems .
    Zhirong Shen, Jiwu Shu, Yingxun Fu
    IEEE Transactions on Parallel and Distributed Systems (TPDS), 2016
    Paper
  • Reconsidering single disk failure recovery for erasure coded storage systems: Optimizing load balancing in stack-level .
    IEEE Transactions on Parallel and Distributed Systems (TPDS), 2016
    Paper
  • Blurred persistence: Efficient transactions in persistent memory .
    Youyou Lu, Jiwu Shu, Long Sun
    ACM Transactions on Storage (TOS), 2016
    Paper
  • Supporting system consistency with differential transactions in flash-based SSDs .
    Youyou Lu, Jiwu Shu, Jia Guo , Peng Zhu
    IEEE Transactions on Computers (TC), 2016
    Paper
  • Caco: An efficient cauchy coding approach for cloud storage systems .
    Guangyan Zhang, Guiyong Wu , Shupeng Wang , Jiwu Shu, Weimin Zheng , Keqin Li
    IEEE Transactions on Computers (TC), 2016
    Paper
  • High-performance and lightweight transaction support in flash-based SSDs .
    Youyou Lu, Jiwu Shu, Jia Guo , Shuai Li , Onur Mutlu
    IEEE Transactions on Computers (TC), 2015
    Paper
  • Redistribute Data to Regain Load Balance during RAID-4 Scaling .
    Guangyan Zhang, Jigang Wang , Keqin Li , Jiwu Shu
    IEEE Transactions on Parallel and Distributed Systems (TPDS), 2015
    Paper
  • Design and implementation of an asymmetric block-based parallel file system .
    Letian Yi , Jiwu Shu, Ying Zhao , Yinjin Qing , Youyou Lu, Weiming Zheng
    IEEE Transactions on Computers (TC), 2014
    Paper
  • Generalized X-code: An efficient RAID-6 code for arbitrary size of disk array .
    Xianghong Luo , Jiwu Shu
    ACM Transactions on Storage (TOS), 2012
    Paper
  • Preventing Silent Data Corruptions from Propagating During Data Reconstruction .
    Mingqiang Li , Jiwu Shu
    IEEE Transactions on Computers (TC), 2010
    Paper
  • SOPA: Selecting the Optimal Policy Adaptively for a cache system .
    ACM Transactions on Storage (TOS), 2010
    Paper
  • DACO: A High Performance Disk Architecture Designed Specially for Large Scale Erasure Coded Storage Systems .
    Mingqiang Li , Jiwu Shu
    IEEE Transactions on Computers (TC), 2010
    Paper
  • ALV: A New Data Redistribution Approach to RAID-5 Scaling .
    Guangyan Zhang, Weimin Zheng , Jiwu Shu
    IEEE Transactions on Computers (TC), 2010
    Paper
  • GRID Codes: Strip-based Erasure Codes with High Fault Tolerance for Storage Systems .
    Mingqiang Li , Jiwu Shu, Weimin Zheng
    ACM Transactions on Storage (TOS), 2009
    Paper
  • SLAS: An Efficient Approach to Scaling Round-robin Striped Volumes .
    Guangyan Zhang, Jiwu Shu, Wei Xue , Weimin Zheng
    ACM Transactions on Storage (TOS), 2007
    Paper
  • Design and Implementation of an Out-of-Band Virtualization System for Large SANs .
    Guangyan Zhang, Jiwu Shu, Wei Xue , Weimin Zheng
    IEEE Transactions on Computers (TC), 2007
    Paper
  • Design and Implementation of a SAN System Based on the Fiber Channel Protocol .
    Jiwu Shu, Bigang Li , Weimin Zheng
    IEEE Transactions on Computers (TC), 2005
    Paper
  • A Parallel Transient Stability Simulation for Power System .
    Jiwu Shu, Wei Xue , Weimin Zheng
    IEEE Transactions on Power Systems (TOPS), 2005
    Paper



部分中文综述论文

Full Publications