Orientation covariant aggregation of local descriptors with embeddings
Giorgos Tolias (INRIA), Teddy Furon (INRIA), Herv\'e J\'egou (INRIA)

TL;DR
This paper proposes a novel orientation covariant aggregation method for local descriptors that encodes angles continuously, improving image search performance without relying on codebooks, and is compatible with popular encoding techniques.
Contribution
It introduces a continuous angle encoding in local descriptor aggregation, enabling orientation covariance and compatibility with existing encoding methods.
Findings
Effective for image and object retrieval tasks.
Compatible with bag-of-words, VLAD, and Fisher vector.
Improves retrieval accuracy on standard benchmarks.
Abstract
Image search systems based on local descriptors typically achieve orientation invariance by aligning the patches on their dominant orientations. Albeit successful, this choice introduces too much invariance because it does not guarantee that the patches are rotated consistently. This paper introduces an aggregation strategy of local descriptors that achieves this covariance property by jointly encoding the angle in the aggregation stage in a continuous manner. It is combined with an efficient monomial embedding to provide a codebook-free method to aggregate local descriptors into a single vector representation. Our strategy is also compatible and employed with several popular encoding methods, in particular bag-of-words, VLAD and the Fisher vector. Our geometric-aware aggregation strategy is effective for image search, as shown by experiments performed on standard benchmarks for image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Remote-Sensing Image Classification
