Candidate Pseudolabel Learning: Enhancing Vision-Language Models by   Prompt Tuning with Unlabeled Data

Jiahan Zhang; Qi Wei; Feng Liu; Lei Feng

arXiv:2406.10502·cs.LG·June 18, 2024

Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning with Unlabeled Data

Jiahan Zhang, Qi Wei, Feng Liu, Lei Feng

PDF

Open Access 1 Repo

TL;DR

This paper introduces CPL, a novel method for fine-tuning vision-language models using candidate pseudolabels generated through a confidence-based selection process, improving performance with unlabeled data.

Contribution

We propose a candidate pseudolabel learning approach that refines pseudolabels via intra- and inter-instance selection, enhancing unlabeled data utilization in VLM fine-tuning.

Findings

01

Significant performance gains on nine benchmark datasets.

02

Effective handling of low zero-shot performance scenarios.

03

Improved class-balanced instance selection.

Abstract

Fine-tuning vision-language models (VLMs) with abundant unlabeled data recently has attracted increasing attention. Existing methods that resort to the pseudolabeling strategy would suffer from heavily incorrect hard pseudolabels when VLMs exhibit low zero-shot performance in downstream tasks. To alleviate this issue, we propose a Candidate Pseudolabel Learning method, termed CPL, to fine-tune VLMs with suitable candidate pseudolabels of unlabeled data in downstream tasks. The core of our method lies in the generation strategy of candidate pseudolabels, which progressively generates refined candidate pseudolabels by both intra- and inter-instance label selection, based on a confidence score matrix for all unlabeled data. This strategy can result in better performance in true label inclusion and class-balanced instance selection. In this way, we can directly apply existing loss functions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vanillaer/cpl-icml2024
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques