NLPrompt: Noise-Label Prompt Learning for Vision-Language Models

Bikang Pan; Qun Li; Xiaoying Tang; Wei Huang; Zhen Fang; Feng Liu; Jingya Wang; Jingyi Yu; Ye Shi

arXiv:2412.01256·cs.CV·January 29, 2026

NLPrompt: Noise-Label Prompt Learning for Vision-Language Models

Bikang Pan, Qun Li, Xiaoying Tang, Wei Huang, Zhen Fang, Feng Liu, Jingya Wang, Jingyi Yu, Ye Shi

PDF

Open Access 1 Repo

TL;DR

NLPrompt introduces a robust prompt learning framework for vision-language models that effectively handles noisy labels by combining MAE loss and optimal transport-based data purification, leading to improved accuracy and noise resilience.

Contribution

The paper proposes NLPrompt, a novel noise-label prompt learning method that integrates MAE loss and optimal transport for dataset purification, enhancing robustness in vision-language models.

Findings

01

MAE loss improves robustness against noisy labels in prompt learning.

02

PromptOT effectively partitions datasets into clean and noisy subsets.

03

NLPrompt significantly outperforms existing methods across various noise conditions.

Abstract

The emergence of vision-language foundation models, such as CLIP, has revolutionized image-text representation, enabling a broad range of applications via prompt learning. Despite its promise, real-world datasets often contain noisy labels that can degrade prompt learning performance. In this paper, we demonstrate that using mean absolute error (MAE) loss in prompt learning, named PromptMAE, significantly enhances robustness against noisy labels while maintaining high accuracy. Though MAE is straightforward and recognized for its robustness, it is rarely used in noisy-label learning due to its slow convergence and poor performance outside prompt learning scenarios. To elucidate the robustness of PromptMAE, we leverage feature learning theory to show that MAE can suppress the influence of noisy samples, thereby improving the signal-to-noise ratio and enhancing overall robustness.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

qunovo/NLPrompt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Natural Language Processing Techniques · Topic Modeling

MethodsMasked autoencoder · Contrastive Language-Image Pre-training