Semantic Positive Pairs for Enhancing Visual Representation Learning of Instance Discrimination Methods
Mohammad Alkhalefi, Georgios Leontidis, Mingjun Zhong

TL;DR
This paper introduces a method to identify and utilize semantically similar positive pairs in self-supervised learning, improving visual representations by capturing more meaningful features beyond traditional data augmentation.
Contribution
The proposed approach enhances instance discrimination SSL methods by incorporating semantic positive pairs, leading to better representations across multiple datasets and tasks.
Findings
Consistent performance improvements over baseline methods.
4.1% accuracy gain on ImageNet with MoCo-v2.
Effective in semi-supervised, transfer learning, and object detection tasks.
Abstract
Self-supervised learning algorithms (SSL) based on instance discrimination have shown promising results, performing competitively or even outperforming supervised learning counterparts in some downstream tasks. Such approaches employ data augmentation to create two views of the same instance (i.e., positive pairs) and encourage the model to learn good representations by attracting these views closer in the embedding space without collapsing to the trivial solution. However, data augmentation is limited in representing positive pairs, and the repulsion process between the instances during contrastive learning may discard important features for instances that have similar categories. To address this issue, we propose an approach to identify those images with similar semantic content and treat them as positive instances, thereby reducing the chance of discarding important features during…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Image Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques
MethodsBitcoin Customer Service Number +1-833-534-1729 · InfoNCE · Contrastive Learning · Momentum Contrast · *Communicated@Fast*How Do I Communicate to Expedia? · Average Pooling · Residual Connection · Batch Normalization · Global Average Pooling · Kaiming Initialization
