Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution   Detection

Geng Yu; Jianing Zhu; Jiangchao Yao; Bo Han

arXiv:2411.03359·cs.CV·November 7, 2024

Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection

Geng Yu, Jianing Zhu, Jiangchao Yao, Bo Han

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Self-Calibrated Tuning (SCT), a novel framework that improves out-of-distribution detection in vision-language models by adaptively balancing regularization and calibration during prompt tuning with limited ID data.

Contribution

The paper proposes SCT, a new method that dynamically calibrates OOD regularization in prompt tuning, enhancing detection performance with minimal ID data.

Findings

01

SCT improves OOD detection accuracy across multiple benchmarks.

02

The method effectively balances regularization and calibration during training.

03

Extensive experiments validate the robustness of SCT in various scenarios.

Abstract

Out-of-distribution (OOD) detection is crucial for deploying reliable machine learning models in open-world applications. Recent advances in CLIP-based OOD detection have shown promising results via regularizing prompt tuning with OOD features extracted from ID data. However, the irrelevant context mined from ID data can be spurious due to the inaccurate foreground-background decomposition, thus limiting the OOD detection performance. In this work, we propose a novel framework, namely, Self-Calibrated Tuning (SCT), to mitigate this problem for effective OOD detection with only the given few-shot ID data. Specifically, SCT introduces modulating factors respectively on the two components of the original learning objective. It adaptively directs the optimization process between the two tasks during training on data with different prediction uncertainty to calibrate the influence of OOD…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tmlr-group/sct
pytorchOfficial

Videos

Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection· slideslive

Taxonomy

TopicsRemote-Sensing Image Classification