BECLR: Batch Enhanced Contrastive Few-Shot Learning

Stylianos Poulakakis-Daktylidis; Hadi Jamali-Rad

arXiv:2402.02444·cs.CV·February 6, 2024·2 cites

BECLR: Batch Enhanced Contrastive Few-Shot Learning

Stylianos Poulakakis-Daktylidis, Hadi Jamali-Rad

PDF

Open Access 1 Repo 1 Video 3 Reviews

TL;DR

BECLR introduces a novel end-to-end framework for unsupervised few-shot learning that enhances representation separability and addresses sample bias, achieving state-of-the-art results across benchmarks.

Contribution

The paper proposes DyCE and OpTA modules that improve unsupervised contrastive learning and inference, forming a new end-to-end approach called BECLR for few-shot learning.

Findings

01

Sets new state-of-the-art on all U-FSL benchmarks.

02

Significantly outperforms existing baselines.

03

Demonstrates effectiveness of DyCE and OpTA modules.

Abstract

Learning quickly from very few labeled samples is a fundamental attribute that separates machines and humans in the era of deep representation learning. Unsupervised few-shot learning (U-FSL) aspires to bridge this gap by discarding the reliance on annotations at training time. Intrigued by the success of contrastive learning approaches in the realm of U-FSL, we structurally approach their shortcomings in both pretraining and downstream inference stages. We propose a novel Dynamic Clustered mEmory (DyCE) module to promote a highly separable latent representation space for enhancing positive sampling at the pretraining phase and infusing implicit class-level insights into unsupervised contrastive learning. We then tackle the, somehow overlooked yet critical, issue of sample bias at the few-shot inference stage. We propose an iterative Optimal Transport-based distribution Alignment (OpTA)…

Peer Reviews

Decision·ICLR 2024 spotlight

Reviewer 01Rating 8· accept, good paperConfidence 3

Strengths

The paper tried to address several issues within a single framework. It benefits from contrastive pre-training, tries to address and distribution shift and address some of the issues around the memory queue concept. It seems that empirical results are strong.

Weaknesses

-SAMPTransfer (Shirekar et al 2023) is also based on membership but its performance is reported only on the miniImageNet-->CDFSL task (Table 3). It is missing from other experiments. -Prior FSL works that are related to the distribution shift (sample bias) issue are not discussed. -Table 1: For ResNet-50 and Wide ResNet backbones, some of the comparison methods are missing. Again, in Table 3 some of the comparison methods are missing. It seems that the subset of methods used in each experiment

Reviewer 02Rating 6· marginally above the acceptance thresholdConfidence 5

Strengths

**Originality**: The paper showcases a distinct approach by addressing recognized limitations in U-FSL and introducing the BECLR solution, merging existing concepts innovatively and presenting fresh modules like DyCE and OpTA. **Quality**: The research is robust, with DyCE and OpTA being methodically developed and their effectiveness demonstrated through comparisons with established methods like SwaV, SimSiam, and NNCLR. **Clarity**: The authors articulate their findings and methodologies cle

Weaknesses

1. The overarching idea and structure of BECLR bear a striking resemblance to PsCo, particularly when observing Figure 2. It appears that the primary distinction is the concatenation of different views of 'X' from PsCo and the addition of a dynamic clustered memory. To differentiate their work more effectively, the authors should provide a comprehensive comparison with PsCo in both the introduction and related work sections, highlighting the unique aspects of their approach. 2. The notation use

Reviewer 03Rating 6· marginally above the acceptance thresholdConfidence 4

Strengths

The paper is detailed and relatively easy to follow. The underlying method is well explained and motivated and is substantiated by several experiments and ablations. \ The results are promising and impressive and the method is well grounded.

Weaknesses

The evaluation should have considered the various components in separation and with respect to other methods. E.g. is OPTA that improves performance or perhaps adding another base line method might have had a similar performance. \ I think in such multi-component frameworks this is an issue usually. \

Code & Models

Repositories

stypoumic/beclr
pytorchOfficial

Videos

BECLR: Batch Enhanced Contrastive Few-Shot Learning· slideslive

Taxonomy

TopicsCOVID-19 diagnosis using AI · Geophysical Methods and Applications · Domain Adaptation and Few-Shot Learning

MethodsContrastive Learning