Contrastive Learning with Stronger Augmentations

Xiao Wang; Guo-Jun Qi

arXiv:2104.07713·cs.CV·January 24, 2022·37 cites

Contrastive Learning with Stronger Augmentations

Xiao Wang, Guo-Jun Qi

PDF

Open Access 1 Repo

TL;DR

This paper introduces CLSA, a contrastive learning framework that leverages stronger data augmentations to improve image representation, achieving near-supervised accuracy on ImageNet by supervising retrieval with distribution divergence.

Contribution

The paper proposes a novel contrastive learning framework, CLSA, that effectively utilizes stronger augmentations by supervising retrieval through distribution divergence, enhancing representation quality.

Findings

01

CLSA achieves 76.2% top-1 accuracy on ImageNet with ResNet-50.

02

Stronger augmentations, when supervised properly, significantly boost downstream performance.

03

The approach approaches the accuracy of supervised learning on large-scale datasets.

Abstract

Representation learning has significantly been developed with the advance of contrastive learning methods. Most of those methods have benefited from various data augmentations that are carefully designated to maintain their identities so that the images transformed from the same instance can still be retrieved. However, those carefully designed transformations limited us to further explore the novel patterns exposed by other transformations. Meanwhile, as found in our experiments, the strong augmentations distorted the images' structures, resulting in difficult retrieval. Thus, we propose a general framework called Contrastive Learning with Stronger Augmentations~(CLSA) to complement current contrastive learning approaches. Here, the distribution divergence between the weakly and strongly augmented images over the representation bank is adopted to supervise the retrieval of strongly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

maple-research-lab/CLSA
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications

MethodsContrastive Learning