Quality of Service of an Asynchronous Crash-Recovery Leader Election   Algorithm

Vin\'icius A. Reis; Gustavo M. D. Vieira

arXiv:1704.06302·cs.DC·April 24, 2017·2 cites

Quality of Service of an Asynchronous Crash-Recovery Leader Election Algorithm

Vin\'icius A. Reis, Gustavo M. D. Vieira

PDF

Open Access

TL;DR

This paper introduces and analyzes NFD-L, a leader election algorithm for asynchronous crash-recovery systems, focusing on its quality of service, efficiency, and reliability in failure detection.

Contribution

It presents a novel leader election algorithm tailored for crash-recovery models and evaluates its QoS and efficiency in asynchronous distributed systems.

Findings

01

Efficient use of stable memory and message exchanges.

02

Provides predictable QoS in crash-recovery failure models.

03

Analyzes the behavior and reliability of NFD-L.

Abstract

In asynchronous distributed systems it is very hard to assess if one of the processes taking part in a computation is operating correctly or has failed. To overcome this problem, distributed algorithms are created using unreliable failure detectors that capture in an abstract way timing assumptions necessary to assess the operating status of a process. One particular type of failure detector is a leader election, that indicates a single process that has not failed. The unreliability of these failure detectors means that they can make mistakes, however if they are to be used in practice there must be limits to the eventual behavior of these detectors. These limits are defined as the quality of service (QoS) provided by the detector. Many works have tackled the problem of creating failure detectors with predictable QoS, but only for crash-stop processes and synchronous systems. This paper…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed systems and fault tolerance · Optimization and Search Problems · Age of Information Optimization