Inferring Formal Properties of Production Key-Value Stores

Edgar Pek; Pranav Garg; Muntasir Raihan Rahman; Karl Palmskog,; Indranil Gupta; P. Madhusudan

arXiv:1712.10056·cs.DC·January 1, 2018

Inferring Formal Properties of Production Key-Value Stores

Edgar Pek, Pranav Garg, Muntasir Raihan Rahman, Karl Palmskog,, Indranil Gupta, P. Madhusudan

PDF

Open Access 1 Repo

TL;DR

This paper develops formal models for key-value store protocols like Riak and Cassandra, verifying their correctness and eventual consistency through novel certified program models in C.

Contribution

It introduces a new approach of modeling distributed protocols as certified C programs, enabling rigorous verification of properties like eventual consistency.

Findings

01

Models for read-repair and hinted-handoff protocols verified.

02

Protocols guarantee eventual consistency under specified failure assumptions.

03

Conformance tested against real production systems.

Abstract

Production distributed systems are challenging to formally verify, in particular when they are based on distributed protocols that are not rigorously described or fully understood. In this paper, we derive models and properties for two core distributed protocols used in eventually consistent production key-value stores such as Riak and Cassandra. We propose a novel modeling called certified program models, where complete distributed systems are captured as programs written in traditional systems languages such as concurrent C. Specifically, we model the read-repair and hinted-handoff recovery protocols as concurrent C programs, test them for conformance with real systems, and then verify that they guarantee eventual consistency, modeling precisely the specification as well as the failure assumptions under which the results hold.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

palmskog/evcons-model
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed systems and fault tolerance · Software System Performance and Reliability · Service-Oriented Architecture and Web Services