Failure Detectors in Homonymous Distributed Systems (with an Application to Consensus)
Sergio Ar\'evalo, Antonio Fern\'andez Anta, Damien Imbs and, Ernesto Jim\'enez, Michel Raynal

TL;DR
This paper introduces new failure detectors for homonymous distributed systems, enabling consensus solutions even without initial membership knowledge, and demonstrates their implementation under partial synchrony.
Contribution
It defines the classes H\Omega ext{ and }H\Sigma ext{, proves their implementability, and provides algorithms for consensus in homonymous systems with partial synchrony and no initial membership info.
Findings
H\Omega ext{ can be implemented with partial synchrony.
Consensus is solvable in anonymous systems with partial synchrony.
New failure detectors extend consensus capabilities in homonymous systems.
Abstract
This paper addresses the consensus problem in homonymous distributed systems where processes are prone to crash failures and have no initial knowledge of the system membership ("homonymous" means that several processes may have the same identifier). New classes of failure detectors suited to these systems are first defined. Among them, the classes H\Omega\ and H\Sigma\ are introduced that are the homonymous counterparts of the classes \Omega\ and \Sigma, respectively. (Recall that the pair <\Omega,\Sigma> defines the weakest failure detector to solve consensus.) Then, the paper shows how H\Omega\ and H\Sigma\ can be implemented in homonymous systems without membership knowledge (under different synchrony requirements). Finally, two algorithms are presented that use these failure detectors to solve consensus in homonymous asynchronous systems where there is no initial knowledge of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed systems and fault tolerance · Optimization and Search Problems · Mobile Agent-Based Network Management
