A Fault-tolerance Linguistic Structure for Distributed Applications
Vincenzo De Florio

TL;DR
This paper introduces a new linguistic structuring technique for fault-tolerance in distributed applications with soft real-time constraints, aiming to simplify and improve fault management at the application level.
Contribution
It defines a set of structural attributes for assessing fault-tolerance structures and proposes a novel recovery language for better error handling in distributed systems.
Findings
The new structuring technique exhibits satisfactory structural attribute values.
The recovery language effectively addresses error recovery and reconfiguration.
The approach is suitable for soft real-time, distributed, parallel applications.
Abstract
The structures for the expression of fault-tolerance provisions into the application software are the central topic of this dissertation. Structuring techniques provide means to control complexity, the latter being a relevant factor for the introduction of design faults. This fact and the ever increasing complexity of today's distributed software justify the need for simple, coherent, and effective structures for the expression of fault-tolerance in the application software. A first contribution of this dissertation is the definition of a base of structural attributes with which application-level fault-tolerance structures can be qualitatively assessed and compared with each other and with respect to the above mentioned need. This result is then used to provide an elaborated survey of the state-of-the-art of software fault-tolerance structures. The key contribution of this work is a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed systems and fault tolerance · Advanced Software Engineering Methodologies · Distributed and Parallel Computing Systems
