Querying in Constant Expected Time with Learned Indexes
Luis Croquevielle, Guang Yang, Liang Liang, Ali Hadian, Thomas, Heinis

TL;DR
This paper proves that learned indexes can achieve constant expected query time with linear space under weaker assumptions, advancing the theoretical understanding and practical estimation of their performance.
Contribution
It establishes the first constant expected time bound for learned indexes with linear space under weaker probabilistic assumptions, and introduces a new measure of data complexity.
Findings
Achieves $O(1)$ expected query time with linear space.
Introduces a new statistical complexity measure for datasets.
Generalizes previous results with weaker assumptions.
Abstract
Learned indexes leverage machine learning models to accelerate query answering in databases, showing impressive practical performance. However, theoretical understanding of these methods remains incomplete. Existing research suggests that learned indexes have superior asymptotic complexity compared to their non-learned counterparts, but these findings have been established under restrictive probabilistic assumptions. Specifically, for a sorted array with elements, it has been shown that learned indexes can find a key in expected time using at most linear space, compared with for non-learned methods. In this work, we prove expected time can be achieved with at most linear space, thereby establishing the tightest upper bound so far for the time complexity of an asymptotically optimal learned index. Notably, we use weaker probabilistic assumptions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
