No Reason for No Supervision: Improved Generalization in Supervised   Models

Mert Bulent Sariyildiz; Yannis Kalantidis; Karteek Alahari; Diane; Larlus

arXiv:2206.15369·cs.CV·March 13, 2023·6 cites

No Reason for No Supervision: Improved Generalization in Supervised Models

Mert Bulent Sariyildiz, Yannis Kalantidis, Karteek Alahari, Diane, Larlus

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a supervised training method that improves transfer learning performance by optimizing data augmentation and model architecture, achieving state-of-the-art results on ImageNet-1K and transfer tasks.

Contribution

The authors propose a novel supervised learning setup with multi-scale crops and a flexible projector, enhancing transferability without sacrificing in-distribution accuracy.

Findings

01

t-ReX achieves new state-of-the-art transfer learning performance.

02

t-ReX* matches top IN1K accuracy while excelling in transfer tasks.

03

The projector design controls the trade-off between training and transfer performance.

Abstract

We consider the problem of training a deep neural network on a given classification task, e.g., ImageNet-1K (IN1K), so that it excels at both the training task as well as at other (future) transfer tasks. These two seemingly contradictory properties impose a trade-off between improving the model's generalization and maintaining its performance on the original task. Models trained with self-supervised learning tend to generalize better than their supervised counterparts for transfer learning; yet, they still lag behind supervised models on IN1K. In this paper, we propose a supervised learning setup that leverages the best of both worlds. We extensively analyze supervised training using multi-scale crops for data augmentation and an expendable projector head, and reveal that the design of the projector allows us to control the trade-off between performance on the training task and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

naver/trex
pytorch

Videos

No Reason for No Supervision: Improved Generalization in Supervised Models· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI · Advanced Neural Network Applications

MethodsMulti-Head Attention · Attention Is All You Need · Softmax · Layer Normalization · Linear Layer · Dense Connections · Residual Connection · Vision Transformer