Limiting Lamport Exposure to Distant Failures in Globally-Managed Distributed Systems
Cristina B\u{a}sescu, Georgia Fragkouli, Enis Ceyhun Alp, Michael F., Nowlan, Jose M. Faleiro, Gaylor Bosson, Kelong Cong, Pierluca Bors\`o-Tan,, Vero Estrada-Gali\~nanes, and Bryan Ford

TL;DR
Limix is a metadata configuration service that enhances the resilience and availability of globally-managed distributed systems by confining metadata to local regions, thus insulating services from distant failures.
Contribution
Limix introduces a novel approach to global configuration management by ensuring metadata locality, improving resilience and availability in distributed systems.
Findings
Limix maintains high availability even during distant failures.
Experiments show Limix outperforms existing global management solutions.
Limix effectively isolates metadata from remote failures in real-world networks.
Abstract
Globalized computing infrastructures offer the convenience and elasticity of globally managed objects and services, but lack the resilience to distant failures that localized infrastructures such as private clouds provide. Providing both global management and resilience to distant failures, however, poses a fundamental problem for configuration services: How to discover a possibly migratory, strongly-consistent service/object in a globalized infrastructure without dependencies on globalized state? Limix is the first metadata configuration service that addresses this problem. With Limix, global strongly-consistent data-plane services and objects are insulated from remote gray failures by ensuring that the definitive, strongly-consistent metadata for any object is always confined to the same region as the object itself. Limix guarantees availability bounds: any user can continue accessing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · IoT and Edge/Fog Computing · Service-Oriented Architecture and Web Services
