Using RDMA for Efficient Index Replication in LSM Key-Value Stores
Michalis Vardoulakis, Giorgos Saloustros, Pilar Gonz\'alez-F\'erez,, and Angelos Bilas

TL;DR
This paper introduces Talos, a novel RDMA-based approach for efficient index replication in LSM key-value stores, significantly reducing I/O and CPU overhead while maintaining high performance and scalability.
Contribution
Talos leverages RDMA and KV separation to enable efficient primary-backup replication with minimal I/O amplification and CPU overhead, avoiding costly backup compactions.
Findings
Reduces backup I/O amplification by up to 3x
Decreases CPU overhead in backups by up to 1.6x
Allows larger growth factors for space efficiency
Abstract
Log-Structured Merge tree (LSM tree) Key-Value (KV) stores have become a foundational layer in the storage stacks of datacenter and cloud services. Current approaches for achieving reliability and availability avoid replication at the KV store level and instead perform these operations at higher layers, e.g., the DB layer that runs on top of the KV store. The main reason is that past designs for replicated KV stores favor reducing network traffic and increasing I/O size. Therefore, they perform costly compactions to reorganize data in both the primary and backup nodes, which hurts overall system performance. In this paper, we design and implement Talos, an efficient rack-scale LSM-based KV store that aims to significantly reduce the I/O amplification and CPU overhead in backup nodes and make replication in the KV store practical. We rely on two observations: (a) the increased use of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · Advanced Data Storage Technologies · Distributed systems and fault tolerance
