BECLR: Batch Enhanced Contrastive Few-Shot Learning
Stylianos Poulakakis-Daktylidis, Hadi Jamali-Rad

TL;DR
BECLR introduces a novel end-to-end framework for unsupervised few-shot learning that enhances representation separability and addresses sample bias, achieving state-of-the-art results across benchmarks.
Contribution
The paper proposes DyCE and OpTA modules that improve unsupervised contrastive learning and inference, forming a new end-to-end approach called BECLR for few-shot learning.
Findings
Sets new state-of-the-art on all U-FSL benchmarks.
Significantly outperforms existing baselines.
Demonstrates effectiveness of DyCE and OpTA modules.
Abstract
Learning quickly from very few labeled samples is a fundamental attribute that separates machines and humans in the era of deep representation learning. Unsupervised few-shot learning (U-FSL) aspires to bridge this gap by discarding the reliance on annotations at training time. Intrigued by the success of contrastive learning approaches in the realm of U-FSL, we structurally approach their shortcomings in both pretraining and downstream inference stages. We propose a novel Dynamic Clustered mEmory (DyCE) module to promote a highly separable latent representation space for enhancing positive sampling at the pretraining phase and infusing implicit class-level insights into unsupervised contrastive learning. We then tackle the, somehow overlooked yet critical, issue of sample bias at the few-shot inference stage. We propose an iterative Optimal Transport-based distribution Alignment (OpTA)…
Peer Reviews
Decision·ICLR 2024 spotlight
The paper tried to address several issues within a single framework. It benefits from contrastive pre-training, tries to address and distribution shift and address some of the issues around the memory queue concept. It seems that empirical results are strong.
-SAMPTransfer (Shirekar et al 2023) is also based on membership but its performance is reported only on the miniImageNet-->CDFSL task (Table 3). It is missing from other experiments. -Prior FSL works that are related to the distribution shift (sample bias) issue are not discussed. -Table 1: For ResNet-50 and Wide ResNet backbones, some of the comparison methods are missing. Again, in Table 3 some of the comparison methods are missing. It seems that the subset of methods used in each experiment
**Originality**: The paper showcases a distinct approach by addressing recognized limitations in U-FSL and introducing the BECLR solution, merging existing concepts innovatively and presenting fresh modules like DyCE and OpTA. **Quality**: The research is robust, with DyCE and OpTA being methodically developed and their effectiveness demonstrated through comparisons with established methods like SwaV, SimSiam, and NNCLR. **Clarity**: The authors articulate their findings and methodologies cle
1. The overarching idea and structure of BECLR bear a striking resemblance to PsCo, particularly when observing Figure 2. It appears that the primary distinction is the concatenation of different views of 'X' from PsCo and the addition of a dynamic clustered memory. To differentiate their work more effectively, the authors should provide a comprehensive comparison with PsCo in both the introduction and related work sections, highlighting the unique aspects of their approach. 2. The notation use
The paper is detailed and relatively easy to follow. The underlying method is well explained and motivated and is substantiated by several experiments and ablations. \ The results are promising and impressive and the method is well grounded.
The evaluation should have considered the various components in separation and with respect to other methods. E.g. is OPTA that improves performance or perhaps adding another base line method might have had a similar performance. \ I think in such multi-component frameworks this is an issue usually. \
Code & Models
Videos
Taxonomy
TopicsCOVID-19 diagnosis using AI · Geophysical Methods and Applications · Domain Adaptation and Few-Shot Learning
MethodsContrastive Learning
