Using Paxos to Build a Scalable, Consistent, and Highly Available Datastore
Jun Rao (LinkedIn), Eugene J. Shekita (IBM Research), Sandeep Tata, (IBM Research)

TL;DR
This paper presents Spinnaker, a scalable and highly available datastore that uses Paxos-based replication to ensure strong consistency and fault tolerance in a large cluster environment.
Contribution
It introduces a Paxos-based replication protocol for Spinnaker, enabling high availability and consistency in a distributed datastore on commodity hardware.
Findings
Paxos ensures data availability with a majority of replicas alive.
Spinnaker's Paxos replication is competitive with weaker consistency methods.
Read performance is comparable or better than eventually consistent datastores.
Abstract
Spinnaker is an experimental datastore that is designed to run on a large cluster of commodity servers in a single datacenter. It features key-based range partitioning, 3-way replication, and a transactional get-put API with the option to choose either strong or timeline consistency on reads. This paper describes Spinnaker's Paxos-based replication protocol. The use of Paxos ensures that a data partition in Spinnaker will be available for reads and writes as long a majority of its replicas are alive. Unlike traditional master-slave replication, this is true regardless of the failure sequence that occurs. We show that Paxos replication can be competitive with alternatives that provide weaker consistency guarantees. Compared to an eventually consistent datastore, we show that Spinnaker can be as fast or even faster on reads and only 5% to 10% slower on writes.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed systems and fault tolerance · Advanced Data Storage Technologies · Cloud Computing and Resource Management
