Fast and Efficient Merge of Sorted Input Lists in Hardware Using List Offset Merge Sorters
Robert B. Kent, Marios S. Pattichis

TL;DR
This paper introduces List Offset Merge Sorters (LOMS), a hardware device that merges multiple sorted lists faster and more resource-efficiently than previous methods, especially in FPGA implementations.
Contribution
The paper presents LOMS, a novel hardware merge sort device that improves speed and resource efficiency over existing merge sorters like Bitonic and Odd-Even, and supports arbitrary list sizes.
Findings
LOMS 2-way merge sorter merges 64 values in 2.24 ns, 2.63 times faster than Batcher's sorter.
LOMS 3-way merge sorter merges 21 values in 3.4 ns, 1.36 times faster than previous state-of-the-art.
LOMS devices are resource-efficient, enabling larger merges within FPGA constraints.
Abstract
A new set of hardware merge sort devices are introduced here, which merge multiple sorted input lists into a single sorted output list in a fast and efficient manner. In each merge sorter, the values from the sorted input lists are arranged in an input 2-D setup array, but with the order of each sorted input list offset from the order of each of the other sorted input lists. In these new devices, called List Offset Merge Sorters (LOMS), a minimal set of column sort stages alternating with row sort stages process the input setup array into a final output array, now in the defined sorted order. LOMS 2-way sorters, which merge 2 sorted input lists, require only 2 merge stages and are significantly faster than Kenneth Batcher's previous state-of-the-art 2-way merge devices, Bitonic Merge Sorters and Odd-Even Merge Sorters. LOMS 2-way sorters utilize the recently-introduced Single-Stage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
