Enabling Validation for Robust Few-Shot Recognition
Hanxin Wang, Tian Liu, Shu Kong

TL;DR
This paper introduces VEST, a validation-based framework that improves the robustness of few-shot recognition by balancing in-distribution and out-of-distribution data during model tuning, leading to state-of-the-art results.
Contribution
The paper proposes a novel validation strategy integrated into a stage-wise tuning framework to enhance few-shot recognition robustness against OOD data.
Findings
VEST outperforms existing methods on ImageNet OOD benchmarks.
The validation strategy effectively balances ID and OOD performance.
State-of-the-art results achieved on both ID and OOD data.
Abstract
Few-Shot Recognition (FSR) tackles classification tasks by training with minimal task-specific labeled data. Prevailing methods adapt or finetune a pretrained Vision-Language Model (VLM) and augment the scarce training data by retrieving task-relevant but noisy samples from open data sources. The finetuned VLM generalizes decently well to the task-specific in-distribution (ID) test data but struggles with out-of-distribution (OOD) test data. This motivates our study of robust FSR with VLM finetuning. The core challenge of FSR is data scarcity, extending beyond limited training data to a complete lack of validation data. We identify a key paradox as a potential solution: repurposing the retrieved open data for validation. As such retrieved data are inherently OOD compared with the task-specific ID training data, finetuned VLMs yield degraded performance on the retrieved data. This causes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI · Advanced Neural Network Applications
