How Should We Evaluate Data Deletion in Graph-Based ANN Indexes?
Tomohiro Yamashita, Daichi Amagata, Yusuke Matsui

TL;DR
This paper introduces a comprehensive evaluation framework for data deletion in graph-based Approximate Nearest Neighbor Search (ANNS) indexes, addressing a gap in assessing dynamic data support in practical applications.
Contribution
It formalizes and categorizes data deletion methods in graph-based ANNS, proposes evaluation metrics, and demonstrates the framework on Hierarchical Navigable Small World, including a new deletion control method.
Findings
Evaluation framework effectively measures deletion efficiency
Hierarchical Navigable Small World performance analyzed under deletion scenarios
Deletion Control dynamically optimizes deletion method selection
Abstract
Approximate Nearest Neighbor Search (ANNS) has recently gained significant attention due to its many applications, such as Retrieval-Augmented Generation. Such applications require ANNS algorithms that support dynamic data, so the ANNS problem on dynamic data has attracted considerable interest. However, a comprehensive evaluation methodology for data deletion in ANNS has yet to be established. This study proposes an experimental framework and comprehensive evaluation metrics to assess the efficiency of data deletion for ANNS indexes under practical use cases. Specifically, we categorize data deletion methods in graph-based ANNS into three approaches and formalize them mathematically. The performance is assessed in terms of accuracy, query speed, and other relevant metrics. Finally, we apply the proposed evaluation framework to Hierarchical Navigable Small World, one of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Information Retrieval and Search Behavior · Advanced Database Systems and Queries
