Investigating Self-Supervised Methods for Label-Efficient Learning

Srinivasa Rao Nandam; Sara Atito; Zhenhua Feng; Josef Kittler,; Muhammad Awais

arXiv:2406.17460·cs.CV·June 26, 2024

Investigating Self-Supervised Methods for Label-Efficient Learning

Srinivasa Rao Nandam, Sara Atito, Zhenhua Feng, Josef Kittler,, Muhammad Awais

PDF

Open Access

TL;DR

This paper systematically compares self-supervised learning methods for vision transformers in low-shot scenarios, proposing a combined framework that improves performance across various downstream tasks.

Contribution

It introduces a novel framework combining mask image modelling and clustering, enhancing low-shot learning capabilities of vision transformers.

Findings

01

Combined framework outperforms individual methods in low-shot tasks

02

Centring, ME-MAX, sinkhorn improve downstream performance

03

Performance gains observed on full-scale datasets across tasks

Abstract

Vision transformers combined with self-supervised learning have enabled the development of models which scale across large datasets for several downstream tasks like classification, segmentation and detection. The low-shot learning capability of these models, across several low-shot downstream tasks, has been largely under explored. We perform a system level study of different self supervised pretext tasks, namely contrastive learning, clustering, and masked image modelling for their low-shot capabilities by comparing the pretrained models. In addition we also study the effects of collapse avoidance methods, namely centring, ME-MAX, sinkhorn, on these downstream tasks. Based on our detailed analysis, we introduce a framework involving both mask image modelling and clustering as pretext tasks, which performs better across all low-shot downstream tasks, including multi-class…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies