TurboKV: Scaling Up The Performance of Distributed Key-Value Stores With In-Switch Coordination
Hebatalla Eldakiky, David Hung-Chang Du, Eman Ramadan (Department of, Computer Science, Engineering, University of Minnesota - Twin Cities, USA)

TL;DR
TurboKV leverages programmable switches for in-network partition management and load monitoring to significantly enhance throughput and reduce latency in distributed key-value stores.
Contribution
This paper introduces TurboKV, a novel architecture that uses programmable switches for partition management and load balancing in distributed key-value stores.
Findings
Improves throughput of distributed key-value stores.
Reduces latency compared to existing architectures.
Demonstrates effective load balancing with in-network monitoring.
Abstract
The power and flexibility of software-defined networks lead to a programmable network infrastructure in which in-network computation can help accelerating the performance of applications. This can be achieved by offloading some computational tasks to the network. However, what kind of computational tasks should be delegated to the network to accelerate applications performance? In this paper, we propose a way to exploit the usage of programmable switches to scale up the performance of distributed key-value stores. Moreover, as a proof-of-concept, we propose TurboKV, an efficient distributed key-value store architecture that utilizes programmable switches as: 1) partition management nodes to store the key-value store partitions and replicas information; and 2) monitoring stations to measure the load of storage nodes, this monitoring information is used to balance the load among storage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware-Defined Networks and 5G · Software System Performance and Reliability · Cloud Computing and Resource Management
