Is it all a cluster game? -- Exploring Out-of-Distribution Detection based on Clustering in the Embedding Space
Poulami Sinhamahapatra, Rajat Koner, Karsten Roscher, Stephan, G\"unnemann

TL;DR
This paper investigates out-of-distribution detection for image classification by analyzing clustering structures in embedding spaces, comparing various training and clustering strategies, and highlighting the dependence on model architecture and datasets.
Contribution
It provides a comprehensive analysis of clustering-based OOD detection methods, comparing contrastive and cross-entropy training, and evaluates their effectiveness across different datasets and models.
Findings
Supervised contrastive learning yields well-separated clusters.
No single clustering approach is universally best; effectiveness varies with architecture and dataset.
Cross-entropy with cosine similarity often outperforms contrastive methods on certain datasets.
Abstract
It is essential for safety-critical applications of deep neural networks to determine when new inputs are significantly different from the training distribution. In this paper, we explore this out-of-distribution (OOD) detection problem for image classification using clusters of semantically similar embeddings of the training data and exploit the differences in distance relationships to these clusters between in- and out-of-distribution data. We study the structure and separation of clusters in the embedding space and find that supervised contrastive learning leads to well-separated clusters while its self-supervised counterpart fails to do so. In our extensive analysis of different training methods, clustering strategies, distance metrics, and thresholding approaches, we observe that there is no clear winner. The optimal approach depends on the model architecture and selected datasets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
MethodsContrastive Learning
