PACEMAKER: Avoiding HeART attacks in storage clusters with disk-adaptive redundancy
Saurabh Kadekodi, Francisco Maturana, Suhas Jayaram Subramanya,, Juncheng Yang, K. V. Rashmi, Gregory R. Ganger

TL;DR
PACEMAKER is a novel system that enables disk-adaptive redundancy in storage clusters by proactively managing data layouts and transitions, significantly reducing transition overhead while maintaining space efficiency and data protection.
Contribution
It introduces PACEMAKER, a low-overhead orchestrator that mitigates transition overload in disk-adaptive redundancy schemes through proactive data organization and transition timing.
Findings
Transition IO overhead is reduced to under 5% of cluster bandwidth.
PACEMAKER achieves 14-20% space savings in large production clusters.
It maintains data protection without transition overload.
Abstract
Data redundancy provides resilience in large-scale storage clusters, but imposes significant cost overhead. Substantial space-savings can be realized by tuning redundancy schemes to observed disk failure rates. However, prior design proposals for such tuning are unusable in real-world clusters, because the IO load of transitions between schemes overwhelms the storage infrastructure (termed transition overload). This paper analyzes traces for millions of disks from production systems at Google, NetApp, and Backblaze to expose and understand transition overload as a roadblock to disk-adaptive redundancy: transition IO under existing approaches can consume 100% cluster IO continuously for several weeks. Building on the insights drawn, we present PACEMAKER, a low-overhead disk-adaptive redundancy orchestrator. PACEMAKER mitigates transition overload by (1) proactively organizing data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Cloud Computing and Resource Management · Cloud Data Security Solutions
