TagOOD: A Novel Approach to Out-of-Distribution Detection via   Vision-Language Representations and Class Center Learning

Jinglun Li; Xinyu Zhou; Kaixun Jiang; Lingyi Hong; Pinxue Guo; Zhaoyu; Chen; Weifeng Ge; Wenqiang Zhang

arXiv:2408.15566·cs.CV·August 29, 2024

TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning

Jinglun Li, Xinyu Zhou, Kaixun Jiang, Lingyi Hong, Pinxue Guo, Zhaoyu, Chen, Weifeng Ge, Wenqiang Zhang

PDF

Open Access 1 Repo

TL;DR

TagOOD introduces a vision-language based method for out-of-distribution detection that focuses on object semantics and class centers, improving detection accuracy over traditional image-level feature approaches.

Contribution

The paper proposes TagOOD, a novel multimodal approach that decouples object features from images and learns class centers to enhance OOD detection performance.

Findings

01

Outperforms existing OOD detection methods on benchmark datasets

02

Effective in isolating object semantics from irrelevant image features

03

Utilizes distance metrics for efficient OOD scoring

Abstract

Multimodal fusion, leveraging data like vision and language, is rapidly gaining traction. This enriched data representation improves performance across various tasks. Existing methods for out-of-distribution (OOD) detection, a critical area where AI models encounter unseen data in real-world scenarios, rely heavily on whole-image features. These image-level features can include irrelevant information that hinders the detection of OOD samples, ultimately limiting overall performance. In this paper, we propose \textbf{TagOOD}, a novel approach for OOD detection that leverages vision-language representations to achieve label-free object feature decoupling from whole images. This decomposition enables a more focused analysis of object semantics, enhancing OOD detection performance. Subsequently, TagOOD trains a lightweight network on the extracted object features to learn representative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Jarvisgivemeasuit/tagood
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning