Multidimensional Range Queries on Modern Hardware
Stefan Sprenger, Patrick Sch\"afer, Ulf Leser

TL;DR
This paper evaluates whether traditional multidimensional index structures still outperform scans on modern hardware with large memory and parallel processing, finding that scans are generally more efficient for most query selectivities.
Contribution
The study provides a comprehensive performance comparison of classical MDIS and parallel scans on modern hardware, challenging longstanding assumptions.
Findings
Scanning outperforms MDIS for most query selectivities on modern hardware.
Parallelization significantly improves scan performance.
Modern hardware features diminish the advantages of traditional index structures.
Abstract
Range queries over multidimensional data are an important part of database workloads in many applications. Their execution may be accelerated by using multidimensional index structures (MDIS), such as kd-trees or R-trees. As for most index structures, the usefulness of this approach depends on the selectivity of the queries, and common wisdom told that a simple scan beats MDIS for queries accessing more than 15%-20% of a dataset. However, this wisdom is largely based on evaluations that are almost two decades old, performed on data being held on disks, applying IO-optimized data structures, and using single-core systems. The question is whether this rule of thumb still holds when multidimensional range queries (MDRQ) are performed on modern architectures with large main memories holding all data, multi-core CPUs and data-parallel instruction sets. In this paper, we study the question…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
