Synthesizing Near-Boundary OOD Samples for Out-of-Distribution Detection
Jinglun Li, Kaixun Jiang, Zhaoyu Chen, Bo Lin, Yao Tang, Weifeng Ge, Wenqiang Zhang

TL;DR
SynOOD leverages foundation models to generate challenging boundary-aligned OOD samples, which are used to fine-tune CLIP models, significantly improving out-of-distribution detection near the in-distribution boundary.
Contribution
The paper introduces SynOOD, a novel method that uses foundation models to synthesize boundary-aware OOD samples for enhanced OOD detection in vision-language models.
Findings
Achieves state-of-the-art OOD detection on ImageNet benchmark.
Effectively generates nuanced boundary-aligned OOD samples.
Significantly improves boundary-level discrimination between InD and OOD.
Abstract
Pre-trained vision-language models have exhibited remarkable abilities in detecting out-of-distribution (OOD) samples. However, some challenging OOD samples, which lie close to in-distribution (InD) data in image feature space, can still lead to misclassification. The emergence of foundation models like diffusion models and multimodal large language models (MLLMs) offers a potential solution to this issue. In this work, we propose SynOOD, a novel approach that harnesses foundation models to generate synthetic, challenging OOD data for fine-tuning CLIP models, thereby enhancing boundary-level discrimination between InD and OOD samples. Our method uses an iterative in-painting process guided by contextual prompts from MLLMs to produce nuanced, boundary-aligned OOD samples. These samples are refined through noise adjustments based on gradients from OOD scores like the energy score,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
