On the Discriminability of Self-Supervised Representation Learning

Zeen Song; Wenwen Qiang; Changwen Zheng; Fuchun Sun; Hui Xiong

arXiv:2407.13541·cs.CV·August 5, 2025·1 cites

On the Discriminability of Self-Supervised Representation Learning

Zeen Song, Wenwen Qiang, Changwen Zheng, Fuchun Sun, Hui Xiong

PDF

Open Access

TL;DR

This paper analyzes the discriminability issues in self-supervised learning, identifies the crowding problem, and proposes a novel Dynamic Semantic Adjuster to improve feature separation and overall performance.

Contribution

It introduces a theoretical framework linking SSL objectives to risk bounds and proposes DSA, a learnable regulator that enhances SSL discriminability.

Findings

01

DSA significantly improves SSL performance on benchmark datasets.

02

Theoretical analysis explains how reducing intra-class variance benefits generalization.

03

Addressing the crowding problem narrows the gap between SSL and supervised learning.

Abstract

Self-supervised learning (SSL) has recently shown notable success in various visual tasks. However, in terms of discriminability, SSL is still not on par with supervised learning (SL). This paper identifies a key issue, the ``crowding problem," where features from different classes are not well-separated, and there is high intra-class variance. In contrast, SL ensures clear class separation. Our analysis reveals that SSL objectives do not adequately constrain the relationships between samples and their augmentations, leading to poorer performance in complex tasks. We further establish a theoretical framework that connects SSL objectives to cross-entropy risk bounds, explaining how reducing intra-class variance and increasing inter-class separation can improve generalization. To address this, we propose the Dynamic Semantic Adjuster (DSA), a learnable regulator that enhances feature…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition · Text and Document Classification Technologies · Domain Adaptation and Few-Shot Learning