BMTree: Designing, Learning, and Updating Piecewise Space-Filling Curves for Multi-Dimensional Data Indexing
Jiangneng Li, Yuang Liu, Zheng Wang, Gao Cong, Cheng Long, Walid G., Aref, Han Mao Kiah, Bin Cui

TL;DR
This paper introduces BMTree, a novel piecewise space-filling curve structure that uses multiple mapping schemes for different data subspaces, optimized via reinforcement learning for improved multi-dimensional data indexing.
Contribution
The paper proposes BMTree, a new data structure for multi-dimensional indexing that employs piecewise SFCs and a learning-based approach for construction and updates.
Findings
BMTree outperforms existing SFCs in indexing efficiency.
Reinforcement learning effectively guides BMTree construction.
Fast update mechanism adapts to data distribution changes.
Abstract
Space-filling curves (SFC, for short) have been widely applied to index multi-dimensional data, which first maps the data to one dimension, and then a one-dimensional indexing method, e.g., the B-tree indexes the mapped data. Existing SFCs adopt a single mapping scheme for the whole data space. However, a single mapping scheme often does not perform well on all the data space. In this paper, we propose a new type of SFC called piecewise SFCs that adopts different mapping schemes for different data subspaces. Specifically, we propose a data structure termed the Bit Merging tree (BMTree) that can generate data subspaces and their SFCs simultaneously, and achieve desirable properties of the SFC for the whole data space. Furthermore, we develop a reinforcement learning-based solution to build the BMTree, aiming to achieve excellent query performance. To update the BMTree efficiently when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Advanced Clustering Algorithms Research · Rough Sets and Fuzzy Logic
