Updatable Balanced Index for Stable Streaming Similarity Search over Large-Scale Fresh Vectors
Yuhui Lai, Shixun Huang, Sheng Wang

TL;DR
This paper introduces UBIS, an updatable balanced index designed for stable streaming similarity search over large-scale vectors, improving accuracy and update throughput in dynamic data environments.
Contribution
UBIS provides a novel indexing approach that efficiently handles real-time updates and maintains index quality amid high-frequency data streams.
Findings
Achieves up to 77% higher search accuracy.
Provides 45% higher update throughput.
Effectively manages streaming data updates.
Abstract
As artificial intelligence gains more and more popularity, vectors are one of the most widely used data structures for services such as information retrieval and recommendation. Approximate Nearest Neighbor Search (ANNS), which generally relies on indices optimized for fast search to organize large datasets, has played a core role in these popular services. As the frequency of data shift grows, it is crucial for indices to accommodate new data and support real-time updates. Existing researches adopting two different approaches hold the following drawbacks: 1) approaches using additional buffers to temporarily store new data are resource-intensive and inefficient due to the global rebuilding processes; 2) approaches upgrading the internal index structure suffer from performance degradation because of update congestion and imbalanced distribution in streaming workloads. In this paper, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Caching and Content Delivery · Peer-to-Peer Network Technologies
