A$_3$B$_2$: Adaptive Asymmetric Adapter for Alleviating Branch Bias in Vision-Language Image Classification with Few-Shot Learning
Yiyun Zhou, Zhonghua Jiang, Wenkang Han, Kunxi Li, Mingjing Xu, Chang Yao, Jingyuan Chen

TL;DR
This paper introduces A3B2, an adaptive asymmetric adapter that reduces branch bias in vision-language models for few-shot image classification, improving performance across multiple datasets.
Contribution
The paper proposes A3B2, a novel adaptive adapter with uncertainty-aware dampening and asymmetric design to address branch bias in vision-language few-shot learning.
Findings
A3B2 outperforms 11 baselines on 11 datasets.
UAAD automatically suppresses image-branch adaptation under high uncertainty.
Extensive experiments validate the effectiveness of A3B2.
Abstract
Efficient transfer learning methods for large-scale vision-language models (, CLIP) enable strong few-shot transfer, yet existing adaptation methods follow a fixed fine-tuning paradigm that implicitly assumes a uniform importance of the image and text branches, which has not been systematically studied in image classification. Through extensive analysis, we reveal a Branch Bias issue in vision-language image classification: adapting the image encoder does not always improve performance under out-of-distribution settings. Motivated by this observation, we propose AB, an Adaptive Asymmetric Adapter that alleviates Branch Bias in few-shot learning. AB introduces Uncertainty-Aware Adapter Dampening (UAAD), which automatically suppresses image-branch adaptation when prediction uncertainty is high, enabling soft and data-driven control without manual intervention.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
