Reliability of Erasure Coded Storage Systems: A Geometric Approach
Antonio Campello, Vinay A. Vaishampayan

TL;DR
This paper introduces a geometric method to directly compute data loss probabilities in erasure coded storage systems with general repair and failure durations, extending beyond exponential assumptions.
Contribution
It develops a geometric approach to calculate data loss probabilities for arbitrary duration distributions, including exact formulas and bounds, improving reliability analysis methods.
Findings
Closed-form limiting expression for data loss probability.
Geometric approach using polytope volume calculations.
Comparison of theoretical results with simulations.
Abstract
We consider the probability of data loss, or equivalently, the reliability function for an erasure coded distributed data storage system under worst case conditions. Data loss in an erasure coded system depends on probability distributions for the disk repair duration and the disk failure duration. In previous works, the data loss probability of such systems has been studied under the assumption of exponentially distributed disk failure and disk repair durations, using well-known analytic methods from the theory of Markov processes. These methods lead to an estimate of the integral of the reliability function. Here, we address the problem of directly calculating the data loss probability for general repair and failure duration distributions. A closed limiting form is developed for the probability of data loss and it is shown that the probability of the event that a repair duration…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
