Self-supervised visual learning in the low-data regime: a comparative   evaluation

Sotirios Konstantakos; Jorgen Cani; Ioannis Mademlis; Despina Ioanna; Chalkiadaki; Yuki M. Asano; Efstratios Gavves; Georgios Th. Papadopoulos

arXiv:2404.17202·cs.CV·December 30, 2024

Self-supervised visual learning in the low-data regime: a comparative evaluation

Sotirios Konstantakos, Jorgen Cani, Ioannis Mademlis, Despina Ioanna, Chalkiadaki, Yuki M. Asano, Efstratios Gavves, Georgios Th. Papadopoulos

PDF

Open Access

TL;DR

This paper evaluates the effectiveness of self-supervised learning (SSL) in low-data scenarios for visual tasks, showing that domain-specific SSL pretraining can outperform large-scale general dataset pretraining.

Contribution

It provides a comparative analysis of modern SSL methods in low-data regimes, highlighting their performance and behavior in domain-specific applications.

Findings

01

In-domain low-data SSL pretraining often surpasses large-scale general dataset pretraining.

02

Different SSL categories exhibit varied behaviors in low-data settings.

03

SSL can effectively learn representations even with limited data in domain-specific tasks.

Abstract

Self-Supervised Learning (SSL) is a valuable and robust training methodology for contemporary Deep Neural Networks (DNNs), enabling unsupervised pretraining on a 'pretext task' that does not require ground-truth labels/annotation. This allows efficient representation learning from massive amounts of unlabeled training data, which in turn leads to increased accuracy in a 'downstream task' by exploiting supervised transfer learning. Despite the relatively straightforward conceptualization and applicability of SSL, it is not always feasible to collect and/or to utilize very large pretraining datasets, especially when it comes to real-world application settings. In particular, in cases of specialized and domain-specific application scenarios, it may not be achievable or practical to assemble a relevant image pretraining dataset in the order of millions of instances or it could be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOnline Learning and Analytics