Spreading vectors for similarity search

Alexandre Sablayrolles; Matthijs Douze; Cordelia Schmid; Herv\'e; J\'egou

arXiv:1806.03198·stat.ML·September 2, 2019·26 cites

Spreading vectors for similarity search

Alexandre Sablayrolles, Matthijs Douze, Cordelia Schmid, Herv\'e, J\'egou

PDF

Open Access 2 Repos

TL;DR

This paper introduces a novel approach to similarity search by training neural networks to adapt data to fixed, parameter-free quantizers, improving performance and flexibility in high-dimensional indexing.

Contribution

It proposes reversing the traditional quantizer training paradigm by adapting data to a fixed quantizer using neural networks with a new uniformity regularizer.

Findings

01

Outperforms most learned quantization methods

02

Competitive with state-of-the-art benchmarks

03

Training without quantization maintains accuracy

Abstract

Discretizing multi-dimensional data distributions is a fundamental step of modern indexing methods. State-of-the-art techniques learn parameters of quantizers on training data for optimal performance, thus adapting quantizers to the data. In this work, we propose to reverse this paradigm and adapt the data to the quantizer: we train a neural net which last layer forms a fixed parameter-free quantizer, such as pre-defined points of a hyper-sphere. As a proxy objective, we design and train a neural network that favors uniformity in the spherical latent space, while preserving the neighborhood structure after the mapping. We propose a new regularizer derived from the Kozachenko--Leonenko differential entropy estimator to enforce uniformity and combine it with a locality-aware triplet loss. Experiments show that our end-to-end approach outperforms most learned quantization methods, and is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Human Pose and Action Recognition · Advanced Image and Video Retrieval Techniques