Application-layer Fault-Tolerance Protocols
Vincenzo De Florio

TL;DR
This paper emphasizes the importance of application-layer fault-tolerance in creating dependable computer systems, highlighting its role alongside hardware and OS fault-tolerance to address design, interaction, and security faults.
Contribution
It clarifies the significance of application-level fault-tolerance and argues it is essential for comprehensive system dependability, complementing other fault-tolerance approaches.
Findings
Application-layer fault-tolerance addresses design and interaction faults.
It is crucial for preventing security vulnerabilities and malicious attacks.
Combining fault-tolerance at multiple layers enhances system resilience.
Abstract
The central topic of this book is application-level fault-tolerance, that is the methods, architectures, and tools that allow to express a fault-tolerant system in the application software of our computers. Application-level fault-tolerance is a sub-class of software fault-tolerance that focuses on the problems of expressing the problems and solutions of fault-tolerance in the top layer of the hierarchy of virtual machines that constitutes our computers. This book shows that application-level fault-tolerance is a key ingredient to craft truly dependable computer systems--other approaches, such as hardware fault-tolerance, operating system fault-tolerance, or fault-tolerant middleware, are also important ingredients to achieve resiliency, but they are not enough. Failing to address the application layer means leaving a backdoor open to problems such as design faults, interaction faults,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
