Generalized Supervised Contrastive Learning
Jaewon Kim, Hyukjong Lee, Jooyoung Chang, Sang Min Park

TL;DR
This paper introduces a generalized supervised contrastive loss that leverages label distributions, enabling improved training of neural networks and achieving state-of-the-art results on ImageNet, CIFAR10, and CIFAR100 datasets.
Contribution
We propose a novel generalized supervised contrastive loss that utilizes label distributions, enhancing existing contrastive learning methods and enabling better integration of techniques like CutMix and knowledge distillation.
Findings
Achieves 77.3% top-1 accuracy on ImageNet with ResNet50 and Momentum Contrast.
Sets new state-of-the-art accuracies of 98.2% on CIFAR10 and 87.0% on CIFAR100.
Outperforms traditional supervised contrastive learning by 4.1% on ImageNet.
Abstract
With the recent promising results of contrastive learning in the self-supervised learning paradigm, supervised contrastive learning has successfully extended these contrastive approaches to supervised contexts, outperforming cross-entropy on various datasets. However, supervised contrastive learning inherently employs label information in a binary form--either positive or negative--using a one-hot target vector. This structure struggles to adapt to methods that exploit label information as a probability distribution, such as CutMix and knowledge distillation. In this paper, we introduce a generalized supervised contrastive loss, which measures cross-entropy between label similarity and latent similarity. This concept enhances the capabilities of supervised contrastive loss by fully utilizing the label distribution and enabling the adaptation of various existing techniques for training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Image Enhancement Techniques
MethodsCutMix · Residual Connection · Max Pooling · Average Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization · Residual Block · 1x1 Convolution · Global Average Pooling · Bottleneck Residual Block
