Size-aware Sharding For Improving Tail Latencies in In-memory Key-value   Stores

Diego Didona; Willy Zwaenepoel

arXiv:1802.00696·cs.DB·February 5, 2018·26 cites

Size-aware Sharding For Improving Tail Latencies in In-memory Key-value Stores

Diego Didona, Willy Zwaenepoel

PDF

Open Access

TL;DR

This paper proposes size-aware sharding in in-memory key-value stores to significantly reduce tail latencies by preventing head-of-line blocking, with implementation in Minos showing substantial performance improvements.

Contribution

It introduces size-aware sharding for in-memory KV stores, optimizing tail latency and throughput by segregating small and large item requests and implementing hardware dispatch.

Findings

01

Minos achieves up to 100x lower 99th percentile latency.

02

Minos attains up to 7.4x higher throughput compared to state-of-the-art.

03

Size-aware sharding effectively reduces tail latencies in in-memory key-value stores.

Abstract

This paper introduces the concept of size-aware sharding to improve tail latencies for in-memory key-value stores, and describes its implementation in the Minos key-value store. Tail latencies are crucial in distributed applications with high fan-out ratios, because overall response time is determined by the slowest response. Size-aware sharding distributes requests for keys to cores according to the size of the item associated with the key. In particular, requests for small and large items are sent to disjoint subsets of cores. Size-aware sharding improves tail latencies by avoiding head-of-line blocking, in which a request for a small item gets queued behind a request for a large item. Alternative size-unaware approaches to sharding, such as keyhash-based sharding, request dispatching and stealing do not avoid head-of-line blocking, and therefore exhibit worse tail latencies. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCloud Computing and Resource Management · Distributed systems and fault tolerance · Software System Performance and Reliability