# Discriminate-and-Rectify Encoders: Learning from Image Transformation   Sets

**Authors:** Andrea Tacchetti, Stephen Voinea, Georgios Evangelopoulos

arXiv: 1703.04775 · 2017-03-16

## TL;DR

This paper proposes a novel weakly supervised learning framework that learns transformation-robust image embeddings using orbit sets, deep parametrizations, and a new orbit-based loss, improving recognition tasks under visual variability.

## Contribution

It introduces a new orbit-based loss and a framework for learning transformation-invariant embeddings from sets of transformed images, enhancing weakly supervised recognition.

## Key findings

- Embeddings improve one-shot classification under geometric transformations.
- Enhanced face verification and retrieval under visual variability.
- Orbit sets enable efficient weakly-supervised learning.

## Abstract

The complexity of a learning task is increased by transformations in the input space that preserve class identity. Visual object recognition for example is affected by changes in viewpoint, scale, illumination or planar transformations. While drastically altering the visual appearance, these changes are orthogonal to recognition and should not be reflected in the representation or feature encoding used for learning. We introduce a framework for weakly supervised learning of image embeddings that are robust to transformations and selective to the class distribution, using sets of transforming examples (orbit sets), deep parametrizations and a novel orbit-based loss. The proposed loss combines a discriminative, contrastive part for orbits with a reconstruction error that learns to rectify orbit transformations. The learned embeddings are evaluated in distance metric-based tasks, such as one-shot classification under geometric transformations, as well as face verification and retrieval under more realistic visual variability. Our results suggest that orbit sets, suitably computed or observed, can be used for efficient, weakly-supervised learning of semantically relevant image embeddings.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1703.04775/full.md

## Figures

108 figures with captions in the complete paper: https://tomesphere.com/paper/1703.04775/full.md

## References

43 references — full list in the complete paper: https://tomesphere.com/paper/1703.04775/full.md

---
Source: https://tomesphere.com/paper/1703.04775