Analytic Performance Model of a Main-Memory Index Structure
Jonas Schneider

TL;DR
This paper develops a numerical cost model for the Elf, a main-memory index structure optimized for multi-dimensional low-selectivity queries, enabling accurate prediction of search performance and guiding parameter tuning.
Contribution
It introduces a novel cost model for the Elf index, predicting its search region size and response time, addressing the need for robust configuration in high-dimensional data.
Findings
Cost model predicts Elf region size with less than 5% error for up to 15 dimensions.
Elf region size correlates strongly with search response time, with 80% accuracy.
Skewed data distributions can be modeled as reductions in attribute cardinality.
Abstract
Efficient evaluation of multi-dimensional range queries in a main-memory database is an important, but difficult task. State-of-the-art techniques rely on optimised sequential scans or tree-based structures. For range queries with small result sets, sequential scans exhibit poor asymptotic performance. Also, as the dimensionality of the data set increases, the performance of tree-based structures degenerates due to the curse of dimensionality. Recent literature proposed the Elf, a main-memory structure that is optimised for the case of such multi-dimensional low-selectivity queries. The Elf outperforms other state-of-the-art methods in manually tuned scenarios. However, choosing an optimal parameter configuration for the Elf is vital, since for poor configurations, the search performance degrades rapidly. Consequently, further knowledge about the behaviour of the Elf in different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Advanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques
