Testing for Outliers with Conformal p-values

Stephen Bates; Emmanuel Cand\`es; Lihua Lei; Yaniv Romano; Matteo; Sesia

arXiv:2104.08279·stat.ME·March 12, 2024·6 cites

Testing for Outliers with Conformal p-values

Stephen Bates, Emmanuel Cand\`es, Lihua Lei, Yaniv Romano, Matteo, Sesia

PDF

Open Access 1 Repo

TL;DR

This paper develops conformal p-value methods for nonparametric outlier detection, providing finite-sample guarantees and a new approach for valid, independent p-values to improve false discovery rate control.

Contribution

It introduces a novel conformal inference framework that produces valid, independent p-values for outlier detection, enabling stronger error control and uniform false positive bounds.

Findings

01

Proposed conformal p-values are positively dependent and enable FDR control.

02

Developed a new method for valid, independent p-values conditioned on training data.

03

Numerical experiments demonstrate effectiveness on real and simulated data.

Abstract

This paper studies the construction of p-values for nonparametric outlier detection, taking a multiple-testing perspective. The goal is to test whether new independent samples belong to the same distribution as a reference data set or are outliers. We propose a solution based on conformal inference, a broadly applicable framework which yields p-values that are marginally valid but mutually dependent for different test points. We prove these p-values are positively dependent and enable exact false discovery rate control, although in a relatively weak marginal sense. We then introduce a new method to compute p-values that are both valid conditionally on the training data and independent of each other for different test points; this paves the way to stronger type-I error guarantees. Our results depart from classical conformal inference as we leverage concentration inequalities rather than…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

msesia/conditional-conformal-pvalues
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Statistical Process Monitoring · Adversarial Robustness in Machine Learning · Machine Learning and Algorithms