SMORE: A Cold Data Object Store for SMR Drives (Extended Version)
Peter Macko, Xiongzi Ge, John Haskins Jr., James Kelley, David Slik,, Keith A. Smith, Maxim G. Smith

TL;DR
SMORE is an object storage system optimized for shingled magnetic recording drives, enabling efficient storage and retrieval of large, infrequently changed data objects with high throughput and quick crash recovery.
Contribution
It introduces a log-structured, zone-aware storage architecture with an index on flash, tailored for SMR drives, ensuring reliable, efficient large object storage with crash resilience.
Findings
Achieves full disk bandwidth for large object ingestion and reading.
Maintains a write amplification below 2.0 at 80% occupancy.
Recovers from crashes in less than a minute with hourly index snapshots.
Abstract
Shingled magnetic recording (SMR) increases the capacity of magnetic hard drives, but it requires that each zone of a disk be written sequentially and erased in bulk. This makes SMR a good fit for workloads dominated by large data objects with limited churn. To explore this possibility, we have developed SMORE, an object storage system designed to reliably and efficiently store large, seldom-changing data objects on an array of host-managed or host-aware SMR disks. SMORE uses a log-structured approach to accommodate the constraint that all writes to an SMR drive must be sequential within large shingled zones. It stripes data across zones on separate disks, using erasure coding to protect against drive failure. A separate garbage collection thread reclaims space by migrating live data out of the emptiest zones so that they can be trimmed and reused. An index stored on flash and backed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Distributed and Parallel Computing Systems · Distributed systems and fault tolerance
