SINCERE: Supervised Information Noise-Contrastive Estimation REvisited

Patrick Feeney; Michael C. Hughes

arXiv:2309.14277·cs.CV·November 11, 2024·1 cites

SINCERE: Supervised Information Noise-Contrastive Estimation REvisited

Patrick Feeney, Michael C. Hughes

PDF

Open Access 1 Repo

TL;DR

This paper revisits supervised contrastive learning, identifying issues with existing methods and proposing SINCERE, a theoretically justified loss that improves class separation and transfer learning performance.

Contribution

The paper introduces SINCERE, a new supervised InfoNCE loss that removes intra-class repulsion and is backed by theoretical analysis.

Findings

01

SINCERE improves class separation in embeddings.

02

SINCERE enhances transfer learning accuracy.

03

Theoretical bounds relate SINCERE to KL divergence.

Abstract

The information noise-contrastive estimation (InfoNCE) loss function provides the basis of many self-supervised deep learning methods due to its strong empirical results and theoretic motivation. Previous work suggests a supervised contrastive (SupCon) loss to extend InfoNCE to learn from available class labels. This SupCon loss has been widely-used due to reports of good empirical performance. However, in this work we find that the prior SupCon loss formulation has questionable justification because it can encourage some images from the same class to repel one another in the learned embedding space. This problematic intra-class repulsion gets worse as the number of images sharing one class label increases. We propose the Supervised InfoNCE REvisited (SINCERE) loss as a theoretically-justified supervised extension of InfoNCE that eliminates intra-class repulsion. Experiments show that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tufts-ml/supcontrast
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsInfoNCE