Evaluating Reliability of SSD-Based I/O Caches in Enterprise Storage Systems
Saba Ahmadian, Farhad Taheri, Hossein Asadi

TL;DR
This study thoroughly examines the reliability of SSD-based I/O caches in enterprise storage systems under power failures and high temperatures, revealing workload-dependent failure behaviors and the impact of request parameters.
Contribution
It introduces a physical fault injection platform and provides new insights into how workload parameters influence SSD cache reliability under failure conditions.
Findings
Failure rate increases significantly with smaller request sizes.
Read accesses can also fail during power outages, especially during data promotion.
High temperature faults do not cause data failures in the SSD cache.
Abstract
In this paper, we present a comprehensive analysis investigating the reliability of SSD-based I/O caching architectures used in enterprise storage systems under power failure and high-operating temperature. We explore variety of SSDs from top vendors and investigate the cache reliability in mirrored configuration. To this end, we first develop a physical fault injection and failure detection platform and then investigate the impact of workload dependent parameters on the reliability of I/O cache in the presence of two common failure types in data centers, power outage and high temperature faults. We implement an I/O cache scheme using an open-source I/O cache module in Linux operating system. The experimental results obtained by conducting more than twenty thousand of physical fault injections on the implemented I/O cache with different write policies reveal that the failure rate of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Caching and Content Delivery · Parallel Computing and Optimization Techniques
