Out-of-Distribution Detection Should Use Conformal Prediction (and Vice-versa?)
Paul Novello, Joseba Dalmau, L\'eo Andeol

TL;DR
This paper explores the synergy between Out-of-Distribution detection and Conformal Prediction, proposing new metrics and methods that leverage their interplay to improve reliability and interpretability of anomaly detection.
Contribution
It introduces conformal AUROC and conformal FRP@TPR95 metrics, and demonstrates how combining OOD scores with CP enhances detection performance and guarantees.
Findings
New conformal metrics provide probabilistic conservativeness.
Using OOD scores as non-conformity scores improves CP methods.
Applying these techniques on benchmark datasets shows performance gains.
Abstract
Research on Out-Of-Distribution (OOD) detection focuses mainly on building scores that efficiently distinguish OOD data from In Distribution (ID) data. On the other hand, Conformal Prediction (CP) uses non-conformity scores to construct prediction sets with probabilistic coverage guarantees. In this work, we propose to use CP to better assess the efficiency of OOD scores. Specifically, we emphasize that in standard OOD benchmark settings, evaluation metrics can be overly optimistic due to the finite sample size of the test dataset. Based on the work of (Bates et al., 2022), we define new conformal AUROC and conformal FRP@TPR95 metrics, which are corrections that provide probabilistic conservativeness guarantees on the variability of these metrics. We show the effect of these corrections on two reference OOD and anomaly detection benchmarks, OpenOOD (Yang et al., 2022) and ADBench (Han…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Forecasting Techniques and Applications · Air Quality Monitoring and Forecasting
