Self-stabilizing Reconfiguration
Shlomi Dolev, Chryssis Georgiou, Ioannis Marcoullis, Elad M. Schiller

TL;DR
This paper introduces a self-stabilizing reconfiguration scheme that automatically recovers from transient faults and violations of initial assumptions, ensuring system safety and enabling dynamic services in message passing systems.
Contribution
It presents the first self-stabilizing reconfiguration method that recovers from transient faults and relaxes initial assumptions, using reliable failure detectors temporarily.
Findings
Recovers safety automatically after transient faults.
Supports implementation of reconfigurable virtual synchrony.
Enables self-stabilizing reconfigurable state-machine replication.
Abstract
Current reconfiguration techniques are based on starting the system in a consistent configuration, in which all participating entities are in their initial state. Starting from that state, the system must preserve consistency as long as a predefined churn rate of processors joins and leaves is not violated, and unbounded storage is available. Many working systems cannot control this churn rate and do not have access to unbounded storage. System designers that neglect the outcome of violating the above assumptions may doom the system to exhibit illegal behaviors. We present the first automatically recovering reconfiguration scheme that recovers from transient faults, such as temporal violations of the above assumptions. Our self-stabilizing solutions regain safety automatically by assuming temporal access to reliable failure detectors. Once safety is re-established, the failure detector…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed systems and fault tolerance · Parallel Computing and Optimization Techniques · Interconnection Networks and Systems
