Optimal On The Fly Index Selection in Polynomial Time
Herbert Jordan, Bernhard Scholz, Pavle Suboti\'c

TL;DR
This paper introduces a polynomial-time algorithm for optimal index selection tailored for in-memory systems and complex query languages like Datalog, significantly improving query acceleration efficiency.
Contribution
It presents the first polynomial-time solution for minimal index set selection considering complex query decompositions and algebraic properties, addressing a previously intractable problem.
Findings
The algorithm efficiently finds minimal index sets for large in-memory relations.
It leverages a partial order and Dilworth's theorem to reduce complexity.
Experimental results show scalability to billions of entries.
Abstract
The index selection problem (ISP) is an important problem for accelerating the execution of relational queries, and it has received a lot of attention as a combinatorial knapsack problem in the past. Various solutions to this very hard problem have been provided. In contrast to existing literature, we change the underlying assumptions of the problem definition: we adapt the problem for systems that store relations in memory, and use complex specification languages, e.g., Datalog. In our framework, we decompose complex queries into primitive searches that select tuples in a relation for which an equality predicate holds. A primitive search can be accelerated by an index exhibiting a worst-case run-time complexity of log-linear time in the size of the output result of the primitive search. However, the overheads associated with maintaining indexes are very costly in terms of memory and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Data Management and Algorithms · Semantic Web and Ontologies
