# AnchorNet: A Weakly Supervised Network to Learn Geometry-sensitive   Features For Semantic Matching

**Authors:** David Novotny, Diane Larlus, Andrea Vedaldi

arXiv: 1704.04749 · 2017-04-18

## TL;DR

AnchorNet is a weakly supervised deep network designed to learn geometry-sensitive features that enhance semantic matching across different object instances and categories, overcoming limitations of traditional invariant features.

## Contribution

The paper introduces AnchorNet, a novel deep architecture trained with weak labels that produces geometry-aware features for improved semantic matching.

## Key findings

- Outperforms state-of-the-art methods like deformable spatial pyramid and proposal flow.
- Effective in cross-instance and cross-category semantic matching tasks.
- Learns geometry-sensitive features without extensive supervision.

## Abstract

Despite significant progress of deep learning in recent years, state-of-the-art semantic matching methods still rely on legacy features such as SIFT or HoG. We argue that the strong invariance properties that are key to the success of recent deep architectures on the classification task make them unfit for dense correspondence tasks, unless a large amount of supervision is used. In this work, we propose a deep network, termed AnchorNet, that produces image representations that are well-suited for semantic matching. It relies on a set of filters whose response is geometrically consistent across different object instances, even in the presence of strong intra-class, scale, or viewpoint variations. Trained only with weak image-level labels, the final representation successfully captures information about the object structure and improves results of state-of-the-art semantic matching methods such as the deformable spatial pyramid or the proposal flow methods. We show positive results on the cross-instance matching task where different instances of the same object category are matched as well as on a new cross-category semantic matching task aligning pairs of instances each from a different object class.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1704.04749/full.md

## Figures

14 figures with captions in the complete paper: https://tomesphere.com/paper/1704.04749/full.md

## References

64 references — full list in the complete paper: https://tomesphere.com/paper/1704.04749/full.md

---
Source: https://tomesphere.com/paper/1704.04749