Seed the Views: Hierarchical Semantic Alignment for Contrastive Representation Learning
Haohang Xu, Xiaopeng Zhang, Hao Li, Lingxi Xie, Hongkai Xiong, Qi Tian

TL;DR
This paper introduces CsMl, a hierarchical semantic alignment method for contrastive learning that expands positive samples and aligns semantically similar images at multiple levels, improving representation quality and achieving state-of-the-art results.
Contribution
It proposes a novel hierarchical semantic alignment strategy that extends contrastive loss to incorporate multiple positives and multi-level semantic relationships, enhancing existing contrastive learning methods.
Findings
Achieves 76.6% top-1 accuracy with linear evaluation on ResNet-50.
Improves semi-supervised learning accuracy with only 1% and 10% labels.
Sets new state-of-the-art performance in contrastive representation learning.
Abstract
Self-supervised learning based on instance discrimination has shown remarkable progress. In particular, contrastive learning, which regards each image as well as its augmentations as an individual class and tries to distinguish them from all other images, has been verified effective for representation learning. However, pushing away two images that are de facto similar is suboptimal for general representation. In this paper, we propose a hierarchical semantic alignment strategy via expanding the views generated by a single image to \textbf{Cross-samples and Multi-level} representation, and models the invariance to semantically similar images in a hierarchical way. This is achieved by extending the contrastive loss to allow for multiple positives per anchor, and explicitly pulling semantically similar images/patches together at different layers of the network. Our method, termed as CsMl,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications
MethodsContrastive Learning · Batch Normalization · InfoNCE · Momentum Contrast
