TL;DR
This paper introduces Contrastive Logit Score (CLS), a simple post-hoc method that enhances near out-of-distribution detection in vision-language prompt models like CLIP, without retraining or architecture changes.
Contribution
The paper proposes CLS, a novel plug-and-play scoring function that improves near OOD detection for vision-language prompt learning models without retraining.
Findings
Up to 11.67% AUROC improvement in near OOD detection
Effective without modifying model architectures or retraining
Validated through extensive evaluations
Abstract
Prompt learning has emerged as an efficient and effective method for fine-tuning vision-language models such as CLIP. While many studies have explored generalisation abilities of these models in few-shot classification tasks and a few studies have addressed far out-of-distribution (OOD) of the models, their potential for addressing near OOD detection remains underexplored. Existing methods either require training from scratch, need fine-tuning, or are not designed for vision-language prompt learning. To address this, we introduce the Contrastive Logit Score (CLS), a novel post-hoc, plug-and-play scoring function. CLS significantly improves near OOD detection of pre-trained vision-language prompt learning methods without modifying their model architectures or requiring retraining. Our method achieves up to an 11.67% improvement in AUROC for near OOD detection with minimal computational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
