The False Discovery Rate for Statistical Pattern Recognition

Clayton Scott; Gowtham Bellala; Rebecca Willett

arXiv:0901.4184·math.ST·January 28, 2009

The False Discovery Rate for Statistical Pattern Recognition

Clayton Scott, Gowtham Bellala, Rebecca Willett

PDF

Open Access

TL;DR

This paper extends the analysis of false discovery rate (FDR) and false nondiscovery rate (FNDR) to classification, providing finite sample bounds and consistency results for classifiers learned from labeled data.

Contribution

It introduces a novel distribution-free analysis of empirical FDR and FNDR as ratios of binomials, with new bounds and consistency guarantees.

Findings

01

Derived uniform deviation bounds for FDR and FNDR

02

Established finite sample bounds for classifiers using FDR and FNDR

03

Proved strong universal consistency of the proposed methods

Abstract

The false discovery rate (FDR) and false nondiscovery rate (FNDR) have received considerable attention in the literature on multiple testing. These performance measures are also appropriate for classification, and in this work we develop generalization error analyses for FDR and FNDR when learning a classifier from labeled training data. Unlike more conventional classification performance measures, the empirical FDR and FNDR are not binomial random variables but rather a ratio of binomials, which introduces challenges not addressed in conventional analyses. We develop distribution-free uniform deviation bounds and apply these to obtain finite sample bounds and strong universal consistency.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods in Clinical Trials · Statistical Methods and Bayesian Inference · Statistical Methods and Inference