Saliency Guided Contrastive Learning on Scene Images

Meilin Chen; Yizhou Wang; Shixiang Tang; Feng Zhu; Haiyang Yang; Lei; Bai; Rui Zhao; Donglian Qi; Wanli Ouyang

arXiv:2302.11461·cs.CV·February 24, 2023

Saliency Guided Contrastive Learning on Scene Images

Meilin Chen, Yizhou Wang, Shixiang Tang, Feng Zhu, Haiyang Yang, Lei, Bai, Rui Zhao, Donglian Qi, Wanli Ouyang

PDF

Open Access

TL;DR

This paper introduces a saliency-guided contrastive learning method that enhances self-supervised learning on complex scene images by focusing on discriminative regions, leading to improved performance in various evaluation settings.

Contribution

It proposes using saliency maps to identify and emphasize important regions in scene images during contrastive learning, a novel approach for better representation learning from less-curated data.

Findings

01

Achieved +1.1% Top1 accuracy in ImageNet linear evaluation.

02

Improved semi-supervised learning accuracy by +4.3% with 1% labels.

03

Enhanced contrastive learning performance on scene images.

Abstract

Self-supervised learning holds promise in leveraging large numbers of unlabeled data. However, its success heavily relies on the highly-curated dataset, e.g., ImageNet, which still needs human cleaning. Directly learning representations from less-curated scene images is essential for pushing self-supervised learning to a higher level. Different from curated images which include simple and clear semantic information, scene images are more complex and mosaic because they often include complex scenes and multiple objects. Despite being feasible, recent works largely overlooked discovering the most discriminative regions for contrastive learning to object representations in scene images. In this work, we leverage the saliency map derived from the model's output during learning to highlight these discriminative regions and guide the whole contrastive learning. Specifically, the saliency map…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications

MethodsContrastive Learning