Can OOD Object Detectors Learn from Foundation Models?

Jiahui Liu; Xin Wen; Shizhen Zhao; Yingxian Chen; Xiaojuan Qi

arXiv:2409.05162·cs.CV·September 10, 2024

Can OOD Object Detectors Learn from Foundation Models?

Jiahui Liu, Xin Wen, Shizhen Zhao, Yingxian Chen, Xiaojuan Qi

PDF

Open Access 1 Repo

TL;DR

This paper explores using large foundation models to generate synthetic out-of-distribution samples, improving OOD object detection performance by augmenting training data with minimal synthetic data.

Contribution

It introduces SyncOOD, a novel data curation method leveraging foundation models to synthesize OOD samples for enhanced detection accuracy.

Findings

01

SyncOOD outperforms existing methods on multiple benchmarks.

02

Synthetic OOD data improves ID/OOD decision boundary.

03

Minimal synthetic data suffices for significant gains.

Abstract

Out-of-distribution (OOD) object detection is a challenging task due to the absence of open-set OOD data. Inspired by recent advancements in text-to-image generative models, such as Stable Diffusion, we study the potential of generative models trained on large-scale open-set data to synthesize OOD samples, thereby enhancing OOD object detection. We introduce SyncOOD, a simple data curation method that capitalizes on the capabilities of large foundation models to automatically extract meaningful OOD data from text-to-image generative models. This offers the model access to open-world knowledge encapsulated within off-the-shelf foundation models. The synthetic OOD samples are then employed to augment the training of a lightweight, plug-and-play OOD detector, thus effectively optimizing the in-distribution (ID)/OOD decision boundaries. Extensive experiments across multiple benchmarks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cvmi-lab/syncood
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndustrial Vision Systems and Defect Detection

MethodsDiffusion