Stabilization and Fault-Tolerance in Presence of Unchangeable   Environment Actions

Mohammad Roohitavaf; Sandeep Kulkarni

arXiv:1508.00864·cs.DC·August 5, 2015·1 cites

Stabilization and Fault-Tolerance in Presence of Unchangeable Environment Actions

Mohammad Roohitavaf, Sandeep Kulkarni

PDF

Open Access

TL;DR

This paper addresses adding fault-tolerance to concurrent protocols in environments with unchangeable actions, which are both assistive and disruptive, proposing algorithms that are sound, complete, and computationally efficient.

Contribution

The paper introduces algorithms for stabilizing, failsafe, and masking fault-tolerance in environments with unchangeable actions, extending previous approaches to more complex scenarios.

Findings

01

Algorithms are sound and complete.

02

Adding fault-tolerance remains in P complexity.

03

Previous methods cannot be directly extended to this context.

Abstract

We focus on the problem of adding fault-tolerance to an existing concurrent protocol in the presence of {\em unchangeable environment actions}. Such unchangeable actions occur in practice due to several reasons. One instance includes the case where only a subset of the components/processes can be revised and other components/processes must be as is. Another instance includes cyber-physical systems where revising physical components may be undesirable or impossible. These actions differ from faults in that they are simultaneously {\em assistive} and {\em disruptive}, whereas faults are only disruptive. For example, if these actions are a part of a physical component, their execution is essential for the normal operation of the system. However, they can potentially disrupt actions taken by other components for dealing with faults. Also, one can typically assume that fault actions will…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed systems and fault tolerance · Real-Time Systems Scheduling · Petri Nets in System Modeling