Learning to Index for Nearest Neighbor Search

Chih-Yi Chiu; Amorntip Prayoonwong; and Yin-Chih Liao

arXiv:1807.02962·cs.IR·May 1, 2019

Learning to Index for Nearest Neighbor Search

Chih-Yi Chiu, Amorntip Prayoonwong, and Yin-Chih Liao

PDF

Open Access 2 Repos

TL;DR

This paper introduces a neural network-based ranking model that improves approximate nearest neighbor search by estimating neighbor probabilities, leading to better accuracy and efficiency on large-scale datasets.

Contribution

It proposes a novel probability-based ranking method that replaces traditional distance-based ranking in nearest neighbor search, enhancing accuracy.

Findings

01

Boosts search performance on billion-scale datasets.

02

Outperforms conventional distance-based methods.

03

Effective in large-scale approximate nearest neighbor search.

Abstract

In this study, we present a novel ranking model based on learning neighborhood relationships embedded in the index space. Given a query point, conventional approximate nearest neighbor search calculates the distances to the cluster centroids, before ranking the clusters from near to far based on the distances. The data indexed in the top-ranked clusters are retrieved and treated as the nearest neighbor candidates for the query. However, the loss of quantization between the data and cluster centroids will inevitably harm the search accuracy. To address this problem, the proposed model ranks clusters based on their nearest neighbor probabilities rather than the query-centroid distances. The nearest neighbor probabilities are estimated by employing neural networks to characterize the neighborhood relationships, i.e., the density function of nearest neighbors with respect to the query. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Data Management and Algorithms