Frequency-Aware Graph Construction and Search for Dynamic Vector Databases
Yifan Zhu, Ruijie Zhao, Zhonggen Li, Baihua Zheng, Zhikun Zhang, Zhaoqiang Chen, Congcong Ge

TL;DR
This paper introduces DQF, a dual-index framework for dynamic vector databases that adaptively manages hot and cold data, significantly improving search speed while maintaining high recall in changing query environments.
Contribution
The paper proposes a novel dual-layer index and dynamic search strategy that effectively handles fluctuating query frequencies without full index rebuilds.
Findings
Achieves 2.0-5.7x speedup over state-of-the-art methods
Maintains 95% recall rate in dynamic query scenarios
Avoids full index reconstruction despite changing query distributions
Abstract
Approximate Nearest Neighbor Search (ANNS) is a crucial operation in databases and artificial intelligence. While graph-based ANNS methods like HNSW and NSG excel in performance, they assume uniform query distribution. However, in real-world scenarios, user preferences and temporal dynamics often result in certain data points being queried more frequently than others, and these query patterns can change over time. To better leverage such characteristics, we propose DQF, a novel Dual-Index Query Framework. This framework features a dual-layer index structure and a dynamic search strategy based on a decision tree. The dual-layer index includes a hot index for high-frequency nodes and a full index covering the entire dataset, allowing for the separate management of hot and cold queries. Furthermore, we propose a dynamic search strategy that employs a decision tree to determine whether a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Graph Theory and Algorithms · Advanced Database Systems and Queries
