Parallel R-tree-based Spatial Query Processing on a Commercial Processing-in-Memory System
Tasmia Jannat, Michael Gowanlock, Satish Puri

TL;DR
This paper presents a novel method for executing R-tree spatial range queries directly inside commercial Processing-in-Memory systems, significantly improving speed and energy efficiency over traditional CPU-based approaches.
Contribution
It introduces the first mapping of R-tree range queries onto commercial PIM hardware, utilizing a broadcast-based approach for scalable, energy-efficient spatial query processing.
Findings
Achieved up to 3.66x speedup with 2,540 DPUs on large datasets.
Reduced energy consumption by approximately 3.4x compared to CPU search.
Demonstrated scalable performance with strong linear speedup as DPU count increases.
Abstract
The growing volume of data in scientific domains has made spatial query processing increasingly challenging due to high data transfer costs across the memory hierarchy and limited memory bandwidth. To address these bottlenecks and reduce the energy consumed on data movement, this work explores Processing-in-Memory (PIM) systems by executing range queries directly inside memory chips. Unlike prior PIM studies centered on linear scans or hash-based queries, this work is the first to map R-tree range queries onto commercial PIM hardware. The proposed broadcast-based method constructs the R-tree bottom-up on the CPU, broadcasts top levels to UPMEM DPUs (DRAM Processing Units) for global filtering, and distributes lower levels for parallel batched queries in a CPU-DPU system. We evaluate our approach on two real spatial datasets, Sports (999K rectangles) and Lakes (8.4M rectangles), and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
