The Case for Learned Spatial Indexes
Varun Pandey, Alexander van Renen, Andreas Kipf, Ibrahim Sabek, Jialin, Ding, Alfons Kemper

TL;DR
This paper demonstrates that learned multi-dimensional index structures significantly improve the efficiency of spatial range queries, outperforming classical indexes especially for low selectivity queries.
Contribution
It adapts a state-of-the-art learned multi-dimensional index to classical spatial indexes and shows performance gains through extensive tuning and evaluation.
Findings
Machine learned search is 11.79% to 39.51% faster than binary search.
Linearizing partitions can reduce index lookup bottlenecks.
Learned indexes outperform traditional methods on low selectivity queries.
Abstract
Spatial data is ubiquitous. Massive amounts of data are generated every day from billions of GPS-enabled devices such as cell phones, cars, sensors, and various consumer-based applications such as Uber, Tinder, location-tagged posts in Facebook, Twitter, Instagram, etc. This exponential growth in spatial data has led the research community to focus on building systems and applications that can process spatial data efficiently. In the meantime, recent research has introduced learned index structures. In this work, we use techniques proposed from a state-of-the art learned multi-dimensional index structure (namely, Flood) and apply them to five classical multi-dimensional indexes to be able to answer spatial range queries. By tuning each partitioning technique for optimal performance, we show that (i) machine learned search within a partition is faster by 11.79\% to 39.51\% than binary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Geographic Information Systems Studies · Advanced Database Systems and Queries
