Does The Cloud Need Stabilizing?
Murat Demirbas, Aleksey Charapko, Ailidani Ailijiang

TL;DR
This paper investigates the high availability of cloud computing services, analyzing the role of self-stabilization in managing errors caused by concurrency in distributed programs at large scales.
Contribution
It explores the potential of self-stabilization techniques to enhance cloud service reliability amidst complex concurrency issues.
Findings
Concurrency causes many errors in distributed programs
Self-stabilization could improve cloud service robustness
High availability relies on multiple factors, including stabilization
Abstract
The last decade has witnessed rapid proliferation of cloud computing. While even the smallest distributed programs (with 3-5 actions) produce many unanticipated error cases due to concurrency involved, it seems short of a miracle these web-services are able to operate at those vast scales. In this paper, we explore the factors that contribute most to the high-availability of cloud computing services and examine where self-stabilization could fit in that picture.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Distributed systems and fault tolerance · Distributed and Parallel Computing Systems
