A case study on parametric verification of failure detectors

Thanh-Hai Tran; Igor Konnov; and Josef Widder

arXiv:2112.08826·cs.LO·June 22, 2023

A case study on parametric verification of failure detectors

Thanh-Hai Tran, Igor Konnov, and Josef Widder

PDF

Open Access

TL;DR

This paper presents a formal verification case study of the Chandra and Toueg failure detector under partial synchrony, introducing cutoff results and verifying safety and liveness across multiple frameworks.

Contribution

It formalizes symmetric point-to-point algorithms with cutoff results in various models and demonstrates verification of safety and liveness properties using different tools.

Findings

01

Verification of safety for fixed time bounds using TLA+ and counter automata.

02

Safety for arbitrary time bounds verified with IVy.

03

Liveness verified through inductive invariants in IVy.

Abstract

Partial synchrony is a model of computation in many distributed algorithms and modern blockchains. These algorithms are typically parameterized in the number of participants, and their correctness requires the existence of bounds on message delays and on the relative speed of processes after reaching Global Stabilization Time. These characteristics make partially synchronous algorithms parameterized in the number of processes, and parametric in time bounds, which render automated verification of partially synchronous algorithms challenging. In this paper, we present a case study on formal verification of both safety and liveness of the Chandra and Toueg failure detector that is based on partial synchrony. To this end, we first introduce and formalize the class of symmetric point-to-point algorithms that contains the failure detector. Second, we show that these symmetric point-to-point…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed systems and fault tolerance · Modular Robots and Swarm Intelligence · Petri Nets in System Modeling