On Storage Neural Network Augmented Approximate Nearest Neighbor Search
Taiga Ikeda, Daisuke Miyashita, Jun Deguchi

TL;DR
This paper introduces a neural network-augmented approach for storage-based approximate nearest neighbor search, optimizing cluster selection to reduce data fetches and improve recall in large-scale datasets.
Contribution
It proposes a novel neural network method for predicting relevant clusters, enhancing storage-based ANN efficiency over existing techniques.
Findings
Achieves 90% recall on SIFT1M dataset.
Reduces data fetched by 80% compared to SPANN.
Reduces data fetched by 58% compared to exhaustive k-means.
Abstract
Large-scale approximate nearest neighbor search (ANN) has been gaining attention along with the latest machine learning researches employing ANNs. If the data is too large to fit in memory, it is necessary to search for the most similar vectors to a given query vector from the data stored in storage devices, not from that in memory. The storage device such as NAND flash memory has larger capacity than the memory device such as DRAM, but they also have larger latency to read data. Therefore, ANN methods for storage require completely different approaches from conventional in-memory ANN methods. Since the approximation that the time required for search is determined only by the amount of data fetched from storage holds under reasonable assumptions, our goal is to minimize it while maximizing recall. For partitioning-based ANNs, vectors are partitioned into clusters in the index building…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Image Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques
MethodsSoftmax · Attention Is All You Need · k-Means Clustering
