One Leaf Reveals the Season: Occlusion-Based Contrastive Learning with Semantic-Aware Views for Efficient Visual Representation
Xiaoyu Yang, Lijian Xu, Hongsheng Li, Shaoting Zhang

TL;DR
This paper introduces occluded image contrastive learning (OCL), a simple and scalable pre-training method that masks image patches to generate diverse semantic views, enabling efficient high-level visual representation learning without complex augmentations.
Contribution
The paper presents a novel occlusion-based contrastive learning framework that emphasizes semantic-level view generation and high-level feature extraction, improving efficiency and scalability over existing methods.
Findings
OCL achieves 85.8% accuracy on downstream tasks.
Pre-training with ViT-L/16 takes only 133 hours on 4 GPUs.
OCL outperforms traditional augmentation-based contrastive methods.
Abstract
This paper proposes a scalable and straightforward pre-training paradigm for efficient visual conceptual representation called occluded image contrastive learning (OCL). Our OCL approach is simple: we randomly mask patches to generate different views within an image and contrast them among a mini-batch of images. The core idea behind OCL consists of two designs. First, masked tokens have the potential to significantly diminish the conceptual redundancy inherent in images, and create distinct views with substantial fine-grained differences on the semantic concept level instead of the instance level. Second, contrastive learning is adept at extracting high-level semantic conceptual features during the pre-training, circumventing the high-frequency interference and additional costs associated with image reconstruction. Importantly, OCL learns highly semantic conceptual representations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Domain Adaptation and Few-Shot Learning
MethodsContrastive Learning
