Towards the Sparseness of Projection Head in Self-Supervised Learning

Zeen Song; Xingzhe Su; Jingyao Wang; Wenwen Qiang; Changwen Zheng,; Fuchun Sun

arXiv:2307.08913·cs.LG·July 20, 2023·1 cites

Towards the Sparseness of Projection Head in Self-Supervised Learning

Zeen Song, Xingzhe Su, Jingyao Wang, Wenwen Qiang, Changwen Zheng,, Fuchun Sun

PDF

Open Access

TL;DR

This paper investigates the internal mechanisms of the projection head in self-supervised contrastive learning, revealing that a sparse projection head can improve generalization and representation quality, and introduces SparseHead regularization.

Contribution

It provides theoretical and empirical evidence that a sparse projection head enhances SSL performance and introduces SparseHead, a regularization method to enforce sparsity in the projection head.

Findings

01

Sparse projection head improves contrastive learning performance.

02

SparseHead regularization effectively constrains the projection head.

03

Theoretical analysis links sparsity with better generalization.

Abstract

In recent years, self-supervised learning (SSL) has emerged as a promising approach for extracting valuable representations from unlabeled data. One successful SSL method is contrastive learning, which aims to bring positive examples closer while pushing negative examples apart. Many current contrastive learning approaches utilize a parameterized projection head. Through a combination of empirical analysis and theoretical investigation, we provide insights into the internal mechanisms of the projection head and its relationship with the phenomenon of dimensional collapse. Our findings demonstrate that the projection head enhances the quality of representations by performing contrastive loss in a projected subspace. Therefore, we propose an assumption that only a subset of features is necessary when minimizing the contrastive loss of a mini-batch of data. Theoretical analysis further…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Cancer-related molecular mechanisms research · Mycobacterium research and diagnosis

MethodsContrastive Learning