Accelerating Big-Data Sorting Through Programmable Switches
Yamit Barshatz-Schneor, Roy Friedman

TL;DR
This paper introduces a novel partial sorting algorithm designed for programmable switches to accelerate merge sort in data-center environments, achieving significant runtime improvements by leveraging in-network processing.
Contribution
The paper presents a new partial sorting algorithm compatible with programmable switch constraints, enabling in-network data pre-sorting to reduce server-side sorting workload.
Findings
20%-75% reduction in sorting runtime
Effective utilization of switch parallelism for partial sorting
Improved server-side sorting efficiency
Abstract
Sorting is a fundamental and well studied problem that has been studied extensively. Sorting plays an important role in the area of databases, as many queries can be served much faster if the relations are first sorted. One of the most popular sorting algorithm in databases is merge sort. In modern data-centers, data is stored in storage servers, while processing takes place in compute servers. Hence, in order to compute queries on the data, it must travel through the network from the storage servers to the compute servers. This creates a potential for utilizing programmable switches to perform partial sorting in order to accelerate the sorting process at the server side. This is possible because, as mentioned above, data packets pass through the switch in any case on their way to the server. Alas, programmable switches offer a very restricted and non-intuitive programming model,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware-Defined Networks and 5G · Caching and Content Delivery · Cloud Computing and Resource Management
