Standard Vs Uniform Binary Search and Their Variants in Learned Static Indexing: The Case of the Searching on Sorted Data Benchmarking Software Platform
Domenico Amato, Giosu\`e Lo Bosco, Raffaele Giancarlo

TL;DR
This paper compares standard and uniform binary search variants in learned static indexes, revealing that alternative search routines can outperform traditional methods depending on architecture, thus guiding better final-stage search choices.
Contribution
It investigates the effectiveness of different binary search variants in learned indexes using the SOSD benchmarking platform, providing architecture-dependent guidelines for search routine selection.
Findings
Alternative search routines can outperform lower_bound in learned indexes.
Performance depends on computer architecture.
Guidelines for choosing search routines in learned indexing are proposed.
Abstract
Learned Indexes are a novel approach to search in a sorted table. A model is used to predict an interval in which to search into and a Binary Search routine is used to finalize the search. They are quite effective. For the final stage, usually, the lower_bound routine of the Standard C++ library is used, although this is more of a natural choice rather than a requirement. However, recent studies, that do not use Machine Learning predictions, indicate that other implementations of Binary Search or variants, namely k-ary Search, are better suited to take advantage of the features offered by modern computer architectures. With the use of the Searching on Sorted Sets SOSD Learned Indexing benchmarking software, we investigate how to choose a Search routine for the final stage of searching in a Learned Index. Our results provide indications that better choices than the lower_bound routine…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Advanced Database Systems and Queries · Data Mining Algorithms and Applications
