Distilling Visual Priors from Self-Supervised Learning

Bingchen Zhao; Xin Wen

arXiv:2008.00261·cs.CV·August 4, 2020

Distilling Visual Priors from Self-Supervised Learning

Bingchen Zhao, Xin Wen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a two-phase approach combining self-supervised learning and knowledge distillation to enhance CNN generalization on small datasets, featuring a novel margin loss for contrastive learning.

Contribution

It proposes a new pipeline that distills visual priors from self-supervised models into CNNs, improving performance in data-limited image classification tasks.

Findings

01

Achieves competitive results in VIPriors challenge.

02

Introduces a novel margin loss for contrastive learning.

03

Demonstrates improved generalization on small datasets.

Abstract

Convolutional Neural Networks (CNNs) are prone to overfit small training datasets. We present a novel two-phase pipeline that leverages self-supervised learning and knowledge distillation to improve the generalization ability of CNN models for image classification under the data-deficient setting. The first phase is to learn a teacher model which possesses rich and generalizable visual representations via self-supervised learning, and the second phase is to distill the representations into a student model in a self-distillation manner, and meanwhile fine-tune the student model for the image classification task. We also propose a novel margin loss for the self-supervised contrastive learning proxy task to better learn the representation under the data-deficient scenario. Together with other tricks, we achieve competitive performance in the VIPriors image classification challenge.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

DTennant/distill_visual_priors
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications

MethodsContrastive Learning · Knowledge Distillation