MMDCP: A Distribution-free Approach to Outlier Detection and Classification with Coverage Guarantees and SCW-FDR Control
Youwu Lin, Xiaoyu Qian, Jinru Wu, Qi Liu, Pei Wang

TL;DR
MMDCP is a novel, distribution-free conformal prediction framework that effectively detects outliers and classifies data under label shift, with theoretical guarantees on coverage and false discovery rate control.
Contribution
It introduces MMDCP, combining class-specific distances with conformal prediction, providing the first theoretical analysis of the gap between oracle and empirical conformal p-values.
Findings
Valid coverage under mild conditions
Effective control of class-wise FDR
Superior performance in simulations and real data
Abstract
We propose the Modified Mahalanobis Distance Conformal Prediction (MMDCP), a unified framework for multi-class classification and outlier detection under label shift, where the training and test distributions may differ. In such settings, many existing methods construct nonconformity scores based on empirical cumulative or density functions combined with data-splitting strategies. However, these approaches are often computationally expensive due to their heavy reliance on resampling procedures and tend to produce overly conservative prediction sets with unstable coverage, especially in small samples. To address these challenges, MMDCP combines class-specific distance measures with full conformal prediction to construct a score function, thereby producing adaptive prediction sets that effectively capture both inlier and outlier structures. Under mild regularity conditions, we establish…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Imbalanced Data Classification Techniques · Advanced Statistical Methods and Models
