Handling Imbalanced Pseudolabels for Vision-Language Models with Concept   Alignment and Confusion-Aware Calibrated Margin

Yuchen Wang; Xuefeng Bai; Xiucheng Li; Weili Guan; Liqiang Nie,; Xinyang Chen

arXiv:2505.02056·cs.CV·May 6, 2025

Handling Imbalanced Pseudolabels for Vision-Language Models with Concept Alignment and Confusion-Aware Calibrated Margin

Yuchen Wang, Xuefeng Bai, Xiucheng Li, Weili Guan, Liqiang Nie,, Xinyang Chen

PDF

Open Access

TL;DR

This paper addresses the challenge of imbalanced pseudolabels in vision-language models by identifying key causes and proposing a novel framework with concept alignment and confusion-aware calibration, significantly improving label balance and accuracy.

Contribution

It introduces a new framework that mitigates pseudolabel imbalance by tackling concept mismatch and confusion, with mechanisms for concept alignment and calibrated margins.

Findings

01

Achieves a 6.29% relative improvement over state-of-the-art methods.

02

Effectively enhances pseudolabel accuracy and class balance across six datasets.

03

Demonstrates robustness across three learning paradigms.

Abstract

Adapting vision-language models (VLMs) to downstream tasks with pseudolabels has gained increasing attention. A major obstacle is that the pseudolabels generated by VLMs tend to be imbalanced, leading to inferior performance. While existing methods have explored various strategies to address this, the underlying causes of imbalance remain insufficiently investigated. To fill this gap, we delve into imbalanced pseudolabels and identify two primary contributing factors: concept mismatch and concept confusion. To mitigate these two issues, we propose a novel framework incorporating concept alignment and confusion-aware calibrated margin mechanisms. The core of our approach lies in enhancing underperforming classes and promoting balanced predictions across categories, thus mitigating imbalance. Extensive experiments on six benchmark datasets with three learning paradigms demonstrate that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Natural Language Processing Techniques · Web Data Mining and Analysis