Enhancing CLIP with CLIP: Exploring Pseudolabeling for Limited-Label Prompt Tuning
Cristina Menghini, Andrew Delworth, Stephen H. Bach

TL;DR
This paper explores using zero-shot pseudolabeling to improve CLIP's prompt tuning for image classification, achieving significant accuracy gains across various learning paradigms while reducing class bias.
Contribution
It introduces a unified framework for pseudolabeling with CLIP, leveraging zero-shot capabilities to enhance prompt tuning across multiple learning paradigms.
Findings
Prompt tuning with pseudolabels improves CLIP accuracy significantly.
Iterative pseudolabel refinement enhances performance across paradigms.
Prompt tuning reduces class bias compared to traditional pseudolabeling.
Abstract
Fine-tuning vision-language models (VLMs) like CLIP to downstream tasks is often necessary to optimize their performance. However, a major obstacle is the limited availability of labeled data. We study the use of pseudolabels, i.e., heuristic labels for unlabeled data, to enhance CLIP via prompt tuning. Conventional pseudolabeling trains a model on labeled data and then generates labels for unlabeled data. VLMs' zero-shot capabilities enable a "second generation" of pseudolabeling approaches that do not require task-specific training on labeled data. By using zero-shot pseudolabels as a source of supervision, we observe that learning paradigms such as semi-supervised, transductive zero-shot, and unsupervised learning can all be seen as optimizing the same loss function. This unified view enables the development of versatile training strategies that are applicable across learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsText Readability and Simplification · Open Education and E-Learning · Mathematics, Computing, and Information Processing
MethodsContrastive Language-Image Pre-training
