Design and Analysis of a Logless Dynamic Reconfiguration Protocol
William Schultz, Siyuan Zhou, Ian Dardik, Stavros Tripakis

TL;DR
This paper introduces MongoRaftReconfig, a novel logless dynamic reconfiguration protocol for MongoDB's Raft-based replication system, offering safety guarantees, formal verification, and performance improvements.
Contribution
It presents the first safety proof and formal specification of a reconfiguration protocol for Raft-based systems, with practical engineering insights and performance evaluation.
Findings
Safety properties verified through model checking
Protocol offers performance benefits during reconfiguration
First formal safety proof for Raft reconfiguration protocol
Abstract
Distributed replication systems based on the replicated state machine model have become ubiquitous as the foundation of modern database systems. To ensure availability in the presence of faults, these systems must be able to dynamically replace failed nodes with healthy ones via dynamic reconfiguration. MongoDB is a document oriented database with a distributed replication mechanism derived from the Raft protocol. In this paper, we present MongoRaftReconfig, a novel dynamic reconfiguration protocol for the MongoDB replication system. MongoRaftReconfig utilizes a logless approach to managing configuration state and decouples the processing of configuration changes from the main database operation log. The protocol's design was influenced by engineering constraints faced when attempting to redesign an unsafe, legacy reconfiguration mechanism that existed previously in MongoDB. We provide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
