Contrastive Learning with Stronger Augmentations
Xiao Wang, Guo-Jun Qi

TL;DR
This paper introduces CLSA, a contrastive learning framework that leverages stronger data augmentations to improve image representation, achieving near-supervised accuracy on ImageNet by supervising retrieval with distribution divergence.
Contribution
The paper proposes a novel contrastive learning framework, CLSA, that effectively utilizes stronger augmentations by supervising retrieval through distribution divergence, enhancing representation quality.
Findings
CLSA achieves 76.2% top-1 accuracy on ImageNet with ResNet-50.
Stronger augmentations, when supervised properly, significantly boost downstream performance.
The approach approaches the accuracy of supervised learning on large-scale datasets.
Abstract
Representation learning has significantly been developed with the advance of contrastive learning methods. Most of those methods have benefited from various data augmentations that are carefully designated to maintain their identities so that the images transformed from the same instance can still be retrieved. However, those carefully designed transformations limited us to further explore the novel patterns exposed by other transformations. Meanwhile, as found in our experiments, the strong augmentations distorted the images' structures, resulting in difficult retrieval. Thus, we propose a general framework called Contrastive Learning with Stronger Augmentations~(CLSA) to complement current contrastive learning approaches. Here, the distribution divergence between the weakly and strongly augmented images over the representation bank is adopted to supervise the retrieval of strongly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications
MethodsContrastive Learning
