Representation Learning by Learning to Count

Mehdi Noroozi; Hamed Pirsiavash; Paolo Favaro

arXiv:1708.06734·cs.CV·August 23, 2017

Representation Learning by Learning to Count

Mehdi Noroozi, Hamed Pirsiavash, Paolo Favaro

PDF

2 Repos 1 Video

TL;DR

This paper presents a novel representation learning method that leverages counting visual primitives through equivariance relations, eliminating manual annotations and improving transfer learning performance.

Contribution

It introduces a counting-based supervision signal derived from image transformations, enabling unsupervised representation learning without manual labels.

Findings

01

Achieves state-of-the-art transfer learning results

02

Uses scale and tiling transformations for supervision

03

Demonstrates effectiveness of counting-based supervision

Abstract

We introduce a novel method for representation learning that uses an artificial supervision signal based on counting visual primitives. This supervision signal is obtained from an equivariance relation, which does not require any manual annotation. We relate transformations of images to transformations of the representations. More specifically, we look for the representation that satisfies such relation rather than the transformations that match a given representation. In this paper, we use two image transformations in the context of counting: scaling and tiling. The first transformation exploits the fact that the number of visual primitives should be invariant to scale. The second transformation allows us to equate the total number of visual primitives in each tile to that in the whole image. These two transformations are combined in one constraint and used to train a neural network…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Representation Learning by Learning to Count· youtube