- Perseus: A Fail-Slow Detection Framework for Cloud Storage Systems (FAST'23)
- On Mixing Eventual and Strong Consistency: Acute Cloud Types (TPDS'22)
- A Survey and Classification of Software-Defined Storage Systems (ACM Computing Surveys'20)
- Understanding and Discovering Software Configuration Dependencies in Cloud and Datacenter Systems (ESEC/FSE'20)
- Hybrid Data Reliability for Emerging Key-Value Storage Devices (FAST'20)
- File Systems Unfit as Distributed Storage Backends: Lessons from 10 Years of Ceph Evolution (SOSP'19)
- DistCache: Provable Load Balancing for Large-Scale Storage Systems with Distributed Caching (FAST'19)
- Fail-Slow at Scale: Evidence of Hardware Performance Faults in Large Production Systems (FAST'18)
- Protocol-Aware Recovery for Consensus-Based Storage (FAST'18)
- ScaleCheck: A Single-Machine Approach for Discovering Scalability Bugs in Large Distributed Systems(FAST'18)
- Towards Web-based Delta Synchronization for Cloud Storage Services (FAST'18)
- PFault: A General Framework for Analyzing the Reliability of High-Performance Parallel File Systems (ICS'18)
- Scaling Embedded In-Situ Indexing with DeltaFS (SC'18)
- HopsFS: Scaling Hierarchical File System Metadata Using NewSQL Databases (FAST'17)
- Crystal: Software-Defined Storage for Multi-tenant Object Stores (FAST'17)
- Opening the Chrysalis: On the Real Repair Performance of MSR Codes (FAST'16)
- On the Synchronization Bottleneck of OpenStack Swift-like Cloud Storage Systems (INFOCOM'16)
- SSD Failures in Datacenters: What? When? and Why? (SYSTOR'16)
- ShardFS vs. IndexFS: Replication vs. Caching Strategies for Distributed Metadata Management in Cloud Storage Systems (SoCC'15)
- Eventually Consistent: Not What You Were Expecting?: Methods of quantifying consistency (or lack thereof) in eventually consistent storage systems (ACM Queue'14)
- Benchmarking Eventual Consistency: Lessons Learned from Long-Term Experimental Studies (IC2E'14)
- IndexFS: Scaling File System Metadata Performance with Stateless Caching and Bulk Insertion (SC'14)
- Openstack Swift: Using, Administering, and Developing for Swift Object Storage (Joe Arnold'14)
- Scale and Concurrency of GIGA+: File System Directories with Millions of Files (FAST'11)
- Eventual Consistency: How soon is eventual? (MW4SOC'11)
- Hadoop Distributed File System (MSST'10)
- Bigtable: A Distributed Storage System for Structured Data (OSDI'06)
- Ceph: A Scalable, High-Performance Distributed File System (OSDI'06)
- The Google File System (SOSP'03)
- Efficient Metadata Management in Large Distributed Storage Systems (MSST'03)
- DFS-Perf: A Scalable and Unified Benchmarking Framework for Distributed File Systems (Tech repo from UC Berkeley)
- Practical Design Considerations for Wide Locally Recoverable Codes (LRCs) (FAST'23)
- ParaRC: Embracing Sub-Packetization for Repair Parallelization in MSR-Coded Storage (FAST'23)
- Tiger: Disk-Adaptive Redundancy Without Placement Restrictions (OSDI'22)
- KVRAID: high performance, write efficient, update friendly erasure coding scheme for KV-SSDs (SYSTOR'21)
- Boosting Full-Node Repair in Erasure-Coded Storage (USENEX ATC'21)
- Repair Rate Lower Bounds for Distributed Storage (IEEE Transactions on Information Theory'21)
- StripeFinder: Erasure Coding of Small Objects Over Key-Value Storage Devices (An Uphill Battle) (HotStorage'20)
- An Erasure-Coded Storage System for Edge Computing (ACCESS'20)
- Parity Models: Erasure-Coded Resilience for Prediction Serving Systems (SOSP'19)
- Fast Erasure Coding for Data Storage: A Comprehensive Study of the Acceleration Techniques (FAST'19)
- OpenEC: Toward Unified and Configurable Erasure Coding Management in Distributed Storage Systems (FAST'19)
- Liquid Cloud Storage (TOS'19)
- On Fault Tolerance, Locality, and Optimality in Locally Repairable Codes (ATC'18)
- Clay Codes: Moulding MDS Codes to Yield an MSR Code (FAST'18)
- EC-Bench: Benchmarking Onload and Offload Erasure Coders on Modern Hardware Architectures (BenchCouncil'18)
- Opening the Chrysalis: On the Real Repair Performance of MSR Codes (FAST'16)
- OpenCL-based erasure coding on heterogeneous architectures (ASAP'16)
- Having your cake and eating it too: jointly optimal erasure codes for I/O, storage and network-bandwidth (FAST'15)
- A Tale of Two Erasure Codes in HDFS (FAST'15)
- STAIR Codes: A General Family of Erasure Codes for Tolerating Device and Sector Failures in Practical Storage Systems (FAST'14)
- Rethinking Erasure Codes for Cloud File Systems: Minimizing I/O for Recovery and Degraded Reads (FAST'12)
- A Performance Evaluation and Examination of Open-Source Erasure Coding Libraries For Storage (FAST'06)
- Exploring Fault-Tolerant Erasure Codes for Scalable All-Flash Array Clusters (TPDS: Volume: 30, Issue: 6, 01 June 2019)
- An XOR-Based Erasure-Resilient Coding Scheme (ICSI Technical Report, No. TR-95-048, 1995)
- Effective erasure codes for reliable computer communication protocols(ACM SIGCOMN computer communication review, 1997)
- XORing Elephants: Novel Erasure Codes for Big Data(Proceedings of the VLDB Endowment, 2013)
- Erasure coding vs. replication: Communications and Information Theory(Peer-to-Peer Systems, 2002)