Exploring Large Language Models for Multi-Modal Out-of-Distribution Detection
Yi Dai, Hao Lang, Kaisheng Zeng, Fei Huang, Yongbin Li

TL;DR
This paper proposes a novel method for multi-modal out-of-distribution detection that leverages large language models' world knowledge through selective generation and uncertainty calibration, significantly improving detection performance.
Contribution
It introduces a consistency-based uncertainty calibration technique to selectively utilize LLM-generated features, enhancing multi-modal OOD detection accuracy.
Findings
Outperforms state-of-the-art OOD detection methods
Selective LLM feature generation improves reliability
Uncertainty calibration effectively reduces hallucination effects
Abstract
Out-of-distribution (OOD) detection is essential for reliable and trustworthy machine learning. Recent multi-modal OOD detection leverages textual information from in-distribution (ID) class names for visual OOD detection, yet it currently neglects the rich contextual information of ID classes. Large language models (LLMs) encode a wealth of world knowledge and can be prompted to generate descriptive features for each class. Indiscriminately using such knowledge causes catastrophic damage to OOD detection due to LLMs' hallucinations, as is observed by our analysis. In this paper, we propose to apply world knowledge to enhance OOD detection performance through selective generation from LLMs. Specifically, we introduce a consistency-based uncertainty calibration method to estimate the confidence score of each generation. We further extract visual objects from each image to fully…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · COVID-19 diagnosis using AI · Multimodal Machine Learning Applications
