Synthesizing Near-Boundary OOD Samples for Out-of-Distribution Detection

Jinglun Li; Kaixun Jiang; Zhaoyu Chen; Bo Lin; Yao Tang; Weifeng Ge; Wenqiang Zhang

arXiv:2507.10225·cs.CV·August 22, 2025

Synthesizing Near-Boundary OOD Samples for Out-of-Distribution Detection

Jinglun Li, Kaixun Jiang, Zhaoyu Chen, Bo Lin, Yao Tang, Weifeng Ge, Wenqiang Zhang

PDF

1 Datasets

TL;DR

SynOOD leverages foundation models to generate challenging boundary-aligned OOD samples, which are used to fine-tune CLIP models, significantly improving out-of-distribution detection near the in-distribution boundary.

Contribution

The paper introduces SynOOD, a novel method that uses foundation models to synthesize boundary-aware OOD samples for enhanced OOD detection in vision-language models.

Findings

01

Achieves state-of-the-art OOD detection on ImageNet benchmark.

02

Effectively generates nuanced boundary-aligned OOD samples.

03

Significantly improves boundary-level discrimination between InD and OOD.

Abstract

Pre-trained vision-language models have exhibited remarkable abilities in detecting out-of-distribution (OOD) samples. However, some challenging OOD samples, which lie close to in-distribution (InD) data in image feature space, can still lead to misclassification. The emergence of foundation models like diffusion models and multimodal large language models (MLLMs) offers a potential solution to this issue. In this work, we propose SynOOD, a novel approach that harnesses foundation models to generate synthetic, challenging OOD data for fine-tuning CLIP models, thereby enhancing boundary-level discrimination between InD and OOD samples. Our method uses an iterative in-painting process guided by contextual prompts from MLLMs to produce nuanced, boundary-aligned OOD samples. These samples are refined through noise adjustments based on gradients from OOD scores like the energy score,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Grey0423/nearOOD_of_SynOOD
dataset· 6 dl
6 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.