AmPLe: Supporting Vision-Language Models via Adaptive-Debiased Ensemble Multi-Prompt Learning

Fei Song; Yi Li; Jiangmeng Li; Rui Wang; Changwen Zheng; Fanjiang Xu; Hui Xiong

arXiv:2512.18411·cs.CV·December 23, 2025

AmPLe: Supporting Vision-Language Models via Adaptive-Debiased Ensemble Multi-Prompt Learning

Fei Song, Yi Li, Jiangmeng Li, Rui Wang, Changwen Zheng, Fanjiang Xu, Hui Xiong

PDF

Open Access

TL;DR

AmPLe introduces an adaptive ensemble approach that mitigates model-prompt and sample-prompt biases in vision-language models, significantly improving performance across various tasks by leveraging information theory-based semantics analysis.

Contribution

The paper proposes AmPLe, a novel method that adaptively debiases ensemble weights in multi-prompt learning, addressing biases caused by model-prompt and sample-prompt mismatches.

Findings

01

AmPLe outperforms existing methods on multiple tasks.

02

It effectively reduces biases in prompt-based vision-language models.

03

Theoretical analysis confirms the causal validity of the approach.

Abstract

Multi-prompt learning methods have emerged as an effective approach for facilitating the rapid adaptation of vision-language models to downstream tasks with limited resources. Existing multi-prompt learning methods primarily focus on utilizing various meticulously designed prompts within a single foundation vision-language model to achieve superior performance. However, the overlooked model-prompt matching bias hinders the development of multi-prompt learning, i.e., the same prompt can convey different semantics across distinct vision-language models, such as CLIP-ViT-B/16 and CLIP-ViT-B/32, resulting in inconsistent predictions of identical prompt. To mitigate the impact of this bias on downstream tasks, we explore an ensemble learning approach to sufficiently aggregate the benefits of diverse predictions. Additionally, we further disclose the presence of sample-prompt matching bias,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Topic Modeling