Testing for Outliers with Conformal p-values
Stephen Bates, Emmanuel Cand\`es, Lihua Lei, Yaniv Romano, Matteo, Sesia

TL;DR
This paper develops conformal p-value methods for nonparametric outlier detection, providing finite-sample guarantees and a new approach for valid, independent p-values to improve false discovery rate control.
Contribution
It introduces a novel conformal inference framework that produces valid, independent p-values for outlier detection, enabling stronger error control and uniform false positive bounds.
Findings
Proposed conformal p-values are positively dependent and enable FDR control.
Developed a new method for valid, independent p-values conditioned on training data.
Numerical experiments demonstrate effectiveness on real and simulated data.
Abstract
This paper studies the construction of p-values for nonparametric outlier detection, taking a multiple-testing perspective. The goal is to test whether new independent samples belong to the same distribution as a reference data set or are outliers. We propose a solution based on conformal inference, a broadly applicable framework which yields p-values that are marginally valid but mutually dependent for different test points. We prove these p-values are positively dependent and enable exact false discovery rate control, although in a relatively weak marginal sense. We then introduce a new method to compute p-values that are both valid conditionally on the training data and independent of each other for different test points; this paves the way to stronger type-I error guarantees. Our results depart from classical conformal inference as we leverage concentration inequalities rather than…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Process Monitoring · Adversarial Robustness in Machine Learning · Machine Learning and Algorithms
