AllConcur: Leaderless Concurrent Atomic Broadcast (Extended Version)
Marius Poke, Torsten Hoefler, Colin W. Glass

TL;DR
AllConcur introduces a leaderless, concurrent atomic broadcast protocol that significantly improves throughput and reduces latency in distributed systems, enabling scalable fault-tolerant agreement without central bottlenecks.
Contribution
It presents the first leaderless concurrent atomic broadcast algorithm, achieving high throughput and low latency, surpassing traditional leader-based protocols like Libpaxos.
Findings
Supports up to 135 million requests per second.
Achieves 17x higher throughput than Libpaxos.
Handles scalable fault-tolerant agreement without a central leader.
Abstract
Many distributed systems require coordination between the components involved. With the steady growth of such systems, the probability of failures increases, which necessitates scalable fault-tolerant agreement protocols. The most common practical agreement protocol, for such scenarios, is leader-based atomic broadcast. In this work, we propose AllConcur, a distributed system that provides agreement through a leaderless concurrent atomic broadcast algorithm, thus, not suffering from the bottleneck of a central coordinator. In AllConcur, all components exchange messages concurrently through a logical overlay network that employs early termination to minimize the agreement latency. Our implementation of AllConcur supports standard sockets-based TCP as well as high-performance InfiniBand Verbs communications. AllConcur can handle up to 135 million requests per second and achieves 17x…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed systems and fault tolerance · Cloud Computing and Resource Management · Software System Performance and Reliability
