Deep Unsupervised Learning of Visual Similarities

Artsiom Sanakoyeu; Miguel A. Bautista; Bj\"orn Ommer

arXiv:1802.08562·cs.CV·February 26, 2018

Deep Unsupervised Learning of Visual Similarities

Artsiom Sanakoyeu, Miguel A. Bautista, Bj\"orn Ommer

PDF

TL;DR

This paper introduces an unsupervised deep learning method for visual similarity that groups similar samples and learns a unified representation without labels, addressing challenges of data imbalance and unreliable relationships.

Contribution

It proposes a novel optimization framework that uses weak similarity estimates to form consistent sample batches, enabling CNNs to learn visual similarities without supervision.

Findings

01

Achieved competitive results in posture analysis

02

Performed well in object classification tasks

03

Effectively groups similar samples without labels

Abstract

Exemplar learning of visual similarities in an unsupervised manner is a problem of paramount importance to Computer Vision. In this context, however, the recent breakthrough in deep learning could not yet unfold its full potential. With only a single positive sample, a great imbalance between one positive and many negatives, and unreliable relationships between most samples, training of Convolutional Neural networks is impaired. In this paper we use weak estimates of local similarities and propose a single optimization problem to extract batches of samples with mutually consistent relations. Conflicting relations are distributed over different batches and similar samples are grouped into compact groups. Learning visual similarities is then framed as a sequence of categorization tasks. The CNN then consolidates transitivity relations within and between groups and learns a single…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.