TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning
Jinglun Li, Xinyu Zhou, Kaixun Jiang, Lingyi Hong, Pinxue Guo, Zhaoyu, Chen, Weifeng Ge, Wenqiang Zhang

TL;DR
TagOOD introduces a vision-language based method for out-of-distribution detection that focuses on object semantics and class centers, improving detection accuracy over traditional image-level feature approaches.
Contribution
The paper proposes TagOOD, a novel multimodal approach that decouples object features from images and learns class centers to enhance OOD detection performance.
Findings
Outperforms existing OOD detection methods on benchmark datasets
Effective in isolating object semantics from irrelevant image features
Utilizes distance metrics for efficient OOD scoring
Abstract
Multimodal fusion, leveraging data like vision and language, is rapidly gaining traction. This enriched data representation improves performance across various tasks. Existing methods for out-of-distribution (OOD) detection, a critical area where AI models encounter unseen data in real-world scenarios, rely heavily on whole-image features. These image-level features can include irrelevant information that hinders the detection of OOD samples, ultimately limiting overall performance. In this paper, we propose \textbf{TagOOD}, a novel approach for OOD detection that leverages vision-language representations to achieve label-free object feature decoupling from whole images. This decomposition enables a more focused analysis of object semantics, enhancing OOD detection performance. Subsequently, TagOOD trains a lightweight network on the extracted object features to learn representative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
