Semantic Positive Pairs for Enhancing Visual Representation Learning of   Instance Discrimination Methods

Mohammad Alkhalefi; Georgios Leontidis; Mingjun Zhong

arXiv:2306.16122·cs.CV·May 1, 2025·1 cites

Semantic Positive Pairs for Enhancing Visual Representation Learning of Instance Discrimination Methods

Mohammad Alkhalefi, Georgios Leontidis, Mingjun Zhong

PDF

Open Access

TL;DR

This paper introduces a method to identify and utilize semantically similar positive pairs in self-supervised learning, improving visual representations by capturing more meaningful features beyond traditional data augmentation.

Contribution

The proposed approach enhances instance discrimination SSL methods by incorporating semantic positive pairs, leading to better representations across multiple datasets and tasks.

Findings

01

Consistent performance improvements over baseline methods.

02

4.1% accuracy gain on ImageNet with MoCo-v2.

03

Effective in semi-supervised, transfer learning, and object detection tasks.

Abstract

Self-supervised learning algorithms (SSL) based on instance discrimination have shown promising results, performing competitively or even outperforming supervised learning counterparts in some downstream tasks. Such approaches employ data augmentation to create two views of the same instance (i.e., positive pairs) and encourage the model to learn good representations by attracting these views closer in the embedding space without collapsing to the trivial solution. However, data augmentation is limited in representing positive pairs, and the repulsion process between the instances during contrastive learning may discard important features for instances that have similar categories. To address this issue, we propose an approach to identify those images with similar semantic content and treat them as positive instances, thereby reducing the chance of discarding important features during…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Image Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques

MethodsBitcoin Customer Service Number +1-833-534-1729 · InfoNCE · Contrastive Learning · Momentum Contrast · *Communicated@Fast*How Do I Communicate to Expedia? · Average Pooling · Residual Connection · Batch Normalization · Global Average Pooling · Kaiming Initialization