# Local Features and Visual Words Emerge in Activations

**Authors:** Oriane Sim\'eoni, Yannis Avrithis, Ondrej Chum

arXiv: 1905.06358 · 2019-11-19

## TL;DR

This paper introduces a deep spatial matching method that leverages local features and visual words emerging from CNN activations for improved image retrieval, without additional training or feature detection.

## Contribution

The proposed DSM method achieves state-of-the-art image retrieval performance using only CNN activations, without local feature detection, descriptors, or visual vocabularies.

## Key findings

- State-of-the-art performance on standard benchmarks
- Significant gains with diffusion on nearest-neighbor graph
- Effective across different network architectures

## Abstract

We propose a novel method of deep spatial matching (DSM) for image retrieval. Initial ranking is based on image descriptors extracted from convolutional neural network activations by global pooling, as in recent state-of-the-art work. However, the same sparse 3D activation tensor is also approximated by a collection of local features. These local features are then robustly matched to approximate the optimal alignment of the tensors. This happens without any network modification, additional layers or training. No local feature detection happens on the original image. No local feature descriptors and no visual vocabulary are needed throughout the whole process.   We experimentally show that the proposed method achieves the state-of-the-art performance on standard benchmarks across different network architectures and different global pooling methods. The highest gain in performance is achieved when diffusion on the nearest-neighbor graph of global descriptors is initiated from spatially verified images.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.06358/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/1905.06358/full.md

## References

43 references — full list in the complete paper: https://tomesphere.com/paper/1905.06358/full.md

---
Source: https://tomesphere.com/paper/1905.06358