Parallel $k$d-tree with Batch Updates
Ziyang Men, Zheqi Shen, Yan Gu, Yihan Sun

TL;DR
This paper introduces Pkd-tree, a highly parallel and cache-efficient in-memory $k$d-tree supporting fast construction, batch updates, and various queries, with strong theoretical guarantees and superior practical performance.
Contribution
The paper presents the Pkd-tree, a novel parallel $k$d-tree with optimized algorithms for construction and updates, backed by theoretical bounds and practical implementation.
Findings
Pkd-tree outperforms existing implementations in construction and update speed.
The algorithms achieve strong theoretical bounds on work, span, and cache complexity.
Experimental results show competitive query performance with significantly faster updates.
Abstract
The d-tree is one of the most widely used data structures to manage multi-dimensional data. Due to the ever-growing data volume, it is imperative to consider parallelism in d-trees. However, we observed challenges in existing parallel kd-tree implementations, for both constructions and updates. The goal of this paper is to develop efficient in-memory d-trees by supporting high parallelism and cache-efficiency. We propose the Pkd-tree (Parallel d-tree), a parallel d-tree that is efficient both in theory and in practice. The Pkd-tree supports parallel tree construction, batch update (insertion and deletion), and various queries including k-nearest neighbor search, range query, and range count. We proved that our algorithms have strong theoretical bounds in work (sequential time complexity), span (parallelism), and cache complexity. Our key techniques include 1) an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Advanced Database Systems and Queries · Advanced Data Storage Technologies
