TL;DR
COOkeD is an ensemble approach combining multiple classifiers, including zero-shot CLIP, to significantly improve out-of-distribution detection performance and robustness across various challenging scenarios.
Contribution
This work introduces COOkeD, a modular ensemble method that leverages pre-trained vision-language models for enhanced OOD detection in diverse settings.
Findings
Achieves state-of-the-art OOD detection performance.
Demonstrates robustness under label noise and covariate shifts.
Effective in zero-shot and realistic challenging scenarios.
Abstract
Out-of-distribution (OOD) detection is an important building block in trustworthy image recognition systems as unknown classes may arise at test-time. OOD detection methods typically revolve around a single classifier, leading to a split in the research field between the classical supervised setting (e.g. ResNet18 classifier trained on CIFAR100) vs. the zero-shot setting (class names fed as prompts to CLIP). In both cases, an overarching challenge is that the OOD detection performance is implicitly constrained by the classifier's capabilities on in-distribution (ID) data. In this work, we show that given a little open-mindedness from both ends, remarkable OOD detection can be achieved by instead creating a heterogeneous ensemble - COOkeD combines the predictions of a closed-world classifier trained end-to-end on a specific dataset, a zero-shot CLIP classifier, and a linear probe…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
