Self-stabilizing Multivalued Consensus in Asynchronous Crash-prone Systems
Oskar Lundstr\"om, Michel Raynal, Elad Michael Schiller

TL;DR
This paper introduces the first self-stabilizing, wait-free multivalued consensus algorithm for asynchronous systems that can recover from arbitrary transient faults and process failures, improving reliability and resource efficiency.
Contribution
It presents the first self-stabilizing, wait-free multivalued consensus algorithm for asynchronous message-passing systems, supporting concurrent binary consensus invocations and enhanced fault tolerance.
Findings
Supports recovery from arbitrary transient faults
Achieves wait-freedom in multivalued consensus
Uses fewer resources than previous solutions
Abstract
The problem of multivalued consensus is fundamental in the area of fault-tolerant distributed computing since it abstracts a very broad set of agreement problems in which processes have to uniformly decide on a specific value v in V, where |V| >1. Existing solutions (that tolerate process failures) reduce the multivalued consensus problem to the one of binary consensus, e.g., Mostefaoui-Raynal-Tronel and Zhang-Chen. Our study aims at the design of an even more reliable solution. We do so through the lenses of self-stabilization -- a very strong notion of fault-tolerance. In addition to node and communication failures, self-stabilizing algorithms can recover after the occurrence of arbitrary transient-faults; these faults represent any violation of the assumptions according to which the system was designed to operate (as long as the algorithm code stays intact). This work proposes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed systems and fault tolerance · Distributed and Parallel Computing Systems · Cloud Computing and Resource Management
