Investigating Power Outage Effects on Reliability of Solid-State Drives
Saba Ahmadian, Farhad Taheri, Mehrshad Lotfi, Maryam Karimi, and, Hossein Asad

TL;DR
This study analyzes how workload parameters and realistic power failures affect SSD reliability, revealing that data loss can occur up to 700ms after request completion, influenced by access patterns and request types.
Contribution
The paper introduces a realistic fault injection platform simulating power failures and studies workload effects on SSD reliability, which was lacking in prior research.
Findings
Data loss can occur up to 700ms after request completion.
Failure rate depends on request type, size, and access pattern.
Workload Working Set Size has negligible impact.
Abstract
Solid-State Drives (SSDs) are recently employed in enterprise servers and high-end storage systems in order to enhance performance of storage subsystem. Although employing high speed SSDs in the storage subsystems can significantly improve system performance, it comes with significant reliability threat for write operations upon power failures. In this paper, we present a comprehensive analysis investigating the impact of workload dependent parameters on the reliability of SSDs under power failure for variety of SSDs (from top manufacturers). To this end, we first develop a platform to perform two important features required for study: a) a realistic fault injection into the SSD in the computing systems and b) data loss detection mechanism on the SSD upon power failure. In the proposed physical fault injection platform, SSDs experience a real discharge phase of Power Supply Unit (PSU)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
