Stabilization and Fault-Tolerance in Presence of Unchangeable Environment Actions
Mohammad Roohitavaf, Sandeep Kulkarni

TL;DR
This paper addresses adding fault-tolerance to concurrent protocols in environments with unchangeable actions, which are both assistive and disruptive, proposing algorithms that are sound, complete, and computationally efficient.
Contribution
The paper introduces algorithms for stabilizing, failsafe, and masking fault-tolerance in environments with unchangeable actions, extending previous approaches to more complex scenarios.
Findings
Algorithms are sound and complete.
Adding fault-tolerance remains in P complexity.
Previous methods cannot be directly extended to this context.
Abstract
We focus on the problem of adding fault-tolerance to an existing concurrent protocol in the presence of {\em unchangeable environment actions}. Such unchangeable actions occur in practice due to several reasons. One instance includes the case where only a subset of the components/processes can be revised and other components/processes must be as is. Another instance includes cyber-physical systems where revising physical components may be undesirable or impossible. These actions differ from faults in that they are simultaneously {\em assistive} and {\em disruptive}, whereas faults are only disruptive. For example, if these actions are a part of a physical component, their execution is essential for the normal operation of the system. However, they can potentially disrupt actions taken by other components for dealing with faults. Also, one can typically assume that fault actions will…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed systems and fault tolerance · Real-Time Systems Scheduling · Petri Nets in System Modeling
