Estimating the False Discovery Rate of Variable Selection

Yixiang Luo; William Fithian; Lihua Lei

arXiv:2408.07231·stat.ME·February 25, 2026

Estimating the False Discovery Rate of Variable Selection

Yixiang Luo, William Fithian, Lihua Lei

PDF

Open Access 1 Repo

TL;DR

This paper presents a universal estimator for the false discovery rate in variable selection, applicable across various statistical models, and offers methods to evaluate its bias and standard error.

Contribution

It introduces a new estimator for false discovery rate applicable to multiple model selection procedures, with theoretical bias guarantees and bootstrap-based error assessment.

Findings

01

Estimator is conservative with non-negative bias under standard assumptions.

02

Provides a bootstrap method for standard error estimation.

03

Helps balance prediction accuracy and variable selection in practice.

Abstract

We introduce a generic estimator for the false discovery rate of any model selection procedure, in common statistical modeling settings including the Gaussian linear model, Gaussian graphical model, and model-X setting. We prove that our method has a conservative (non-negative) bias in finite samples under standard statistical assumptions, and provide a bootstrap method for assessing its standard error. For methods like the Lasso, forward-stepwise regression, and the graphical Lasso, our estimator serves as a valuable companion to cross-validation, illuminating the tradeoff between prediction error and variable selection accuracy as a function of the model complexity parameter.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yixiangLuo/hFDR
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenetic and phenotypic traits in livestock