Querying in Constant Expected Time with Learned Indexes

Luis Croquevielle; Guang Yang; Liang Liang; Ali Hadian; Thomas; Heinis

arXiv:2405.03851·cs.DB·October 23, 2024

Querying in Constant Expected Time with Learned Indexes

Luis Croquevielle, Guang Yang, Liang Liang, Ali Hadian, Thomas, Heinis

PDF

TL;DR

This paper proves that learned indexes can achieve constant expected query time with linear space under weaker assumptions, advancing the theoretical understanding and practical estimation of their performance.

Contribution

It establishes the first constant expected time bound for learned indexes with linear space under weaker probabilistic assumptions, and introduces a new measure of data complexity.

Findings

01

Achieves $O(1)$ expected query time with linear space.

02

Introduces a new statistical complexity measure for datasets.

03

Generalizes previous results with weaker assumptions.

Abstract

Learned indexes leverage machine learning models to accelerate query answering in databases, showing impressive practical performance. However, theoretical understanding of these methods remains incomplete. Existing research suggests that learned indexes have superior asymptotic complexity compared to their non-learned counterparts, but these findings have been established under restrictive probabilistic assumptions. Specifically, for a sorted array with $n$ elements, it has been shown that learned indexes can find a key in $O (lo g (lo g n))$ expected time using at most linear space, compared with $O (lo g n)$ for non-learned methods. In this work, we prove $O (1)$ expected time can be achieved with at most linear space, thereby establishing the tightest upper bound so far for the time complexity of an asymptotically optimal learned index. Notably, we use weaker probabilistic assumptions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.