The Role of Model Confidence on Bias Effects in Measured Uncertainties for Vision-Language Models
Xinyi Liu, Weiguang Wang, Hangfeng He

TL;DR
This paper investigates how model confidence influences bias effects on epistemic and aleatoric uncertainty measurements in vision-language models, revealing that lower confidence amplifies bias effects and causes overconfidence in uncertainty estimates.
Contribution
It provides a detailed analysis of bias impacts on uncertainty quantification in vision-language models, highlighting the importance of bias mitigation for reliable model confidence assessment.
Findings
Bias effects are stronger at lower bias-free confidence levels.
Lower confidence leads to greater bias-induced underestimation of epistemic uncertainty.
Bias does not significantly affect aleatoric uncertainty direction.
Abstract
With the growing adoption of Large Language Models (LLMs) for open-ended tasks, accurately assessing epistemic uncertainty, which reflects a model's lack of knowledge, has become crucial to ensuring reliable outcomes. However, quantifying epistemic uncertainty in such tasks is challenging due to the presence of aleatoric uncertainty, which arises from multiple valid answers. While bias can introduce noise into epistemic uncertainty estimation, it may also reduce noise from aleatoric uncertainty. To investigate this trade-off, we conduct experiments on Visual Question Answering (VQA) tasks and find that mitigating prompt-introduced bias improves uncertainty quantification in GPT-4o. Building on prior work showing that LLMs tend to copy input information when model confidence is low, we further analyze how these prompt biases affect measured epistemic and aleatoric uncertainty across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Topic Modeling
