Boosting Self-Supervised Learning via Knowledge Transfer

Mehdi Noroozi; Ananth Vinjimoor; Paolo Favaro; Hamed Pirsiavash

arXiv:1805.00385·cs.CV·May 2, 2018

Boosting Self-Supervised Learning via Knowledge Transfer

Mehdi Noroozi, Ananth Vinjimoor, Paolo Favaro, Hamed Pirsiavash

PDF

TL;DR

This paper introduces a new framework for self-supervised learning that enables effective knowledge transfer between models of different architectures and depths, significantly improving performance on standard benchmarks.

Contribution

The framework decouples model structure from the target task, allowing for better model comparison, deeper representations, and knowledge transfer to shallower models, leading to state-of-the-art results.

Findings

01

Deeper models learn better representations from the same pretext task.

02

Knowledge transfer boosts shallow model performance.

03

Achieved state-of-the-art results on PASCAL VOC 2007, ILSVRC12, and Places.

Abstract

In self-supervised learning, one trains a model to solve a so-called pretext task on a dataset without the need for human annotation. The main objective, however, is to transfer this model to a target domain and task. Currently, the most effective transfer strategy is fine-tuning, which restricts one to use the same model or parts thereof for both pretext and target tasks. In this paper, we present a novel framework for self-supervised learning that overcomes limitations in designing and comparing different tasks, models, and data domains. In particular, our framework decouples the structure of the self-supervised model from the final task-specific fine-tuned model. This allows us to: 1) quantitatively assess previously incompatible models including handcrafted features; 2) show that deeper neural network models can learn better representations from the same pretext task; 3) transfer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.