Efficient Management of High-Frequency Sensor Data Streams Using a Read-Optimized Learned Index
Hu Luo, Jiabao Wen, Desheng Chen, Zhengjian Li, Meng Xi, Jingyi He, Shuai Xiao, Jiachen Yang

TL;DR
This paper introduces DyGLIN, a new index system for managing high-frequency sensor data in IoT environments, which improves query performance and handles updates more efficiently.
Contribution
DyGLIN introduces a decoupled leaf architecture with hierarchical filtering and a Delta Buffer mechanism for efficient spatial indexing in dynamic IoT workloads.
Findings
DyGLIN reduces query latency by 26.4% compared to GLIN.
It achieves 30.0% higher insertion throughput with minimal memory overhead increase.
The system shows superior deletion performance in dynamic workloads.
Abstract
The rapid growth of sensor data in IoT and Digital Twins necessitates high-performance spatial indexing. Traditional indexes like Rtrees suffer from high storage overhead, while state-of-the-art learned indexes like GLIN encounter a “Refinement Bottleneck” due to coarse-grained Minimum Bounding Rectangle (MBR) filtering. Furthermore, existing solutions often trade update throughput for query accuracy, failing in dynamic IoT workloads with concurrent reads and writes. We propose DyGLIN (Dynamic Generate Learning-Based Index), a dynamic, read-optimized learned spatial index tailored for high-frequency sensor streams. DyGLIN introduces a decoupled leaf architecture separating query processing from data maintenance. To accelerate queries, we implement a hierarchical filtering pipeline using hierarchical MBRs (HMBR) and Cuckoo Filters to aggressively prune false positives. For maintenance, a…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Time Series Analysis and Forecasting · Graph Theory and Algorithms
