Making Democracy Work: Fixing and Simplifying Egalitarian Paxos (Extended Version)
Fedor Ryabinin, Alexey Gotsman, and Pierre Sutra

TL;DR
This paper introduces EPaxos*, a simplified and correct variant of Egalitarian Paxos, enhancing fault tolerance and efficiency in leaderless state-machine replication protocols.
Contribution
It presents a simpler failure-recovery algorithm and generalizes Egalitarian Paxos to cover a broader range of failure thresholds with proven optimal process counts.
Findings
EPaxos* is simpler and correct compared to Egalitarian Paxos.
The protocol maintains non-zero throughput with up to f crashes.
It generalizes to cover the entire spectrum of failure thresholds with optimal process counts.
Abstract
Classical state-machine replication protocols, such as Paxos, rely on a distinguished leader process to order commands. Unfortunately, this approach makes the leader a single point of failure and increases the latency for clients that are not co-located with it. As a response to these drawbacks, Egalitarian Paxos introduced an alternative, leaderless approach, that allows replicas to order commands collaboratively. Not relying on a single leader allows the protocol to maintain non-zero throughput with up to crashes of any processes out of a total of . The protocol furthermore allows any process to execute a command fast, in message delays, provided no more than other processes fail, and all concurrently submitted commands commute with ; the latter condition is often satisfied in practical systems. Egalitarian Paxos has served as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
