Uncertainty quantification for robust variable selection and multiple   testing

Eduard Belitser; Nurzhan Nurushev

arXiv:2109.09239·math.ST·September 24, 2021

Uncertainty quantification for robust variable selection and multiple testing

Eduard Belitser, Nurzhan Nurushev

PDF

Open Access

TL;DR

This paper introduces a unified procedure for robust variable selection and multiple testing under non-normal, dependent data, incorporating uncertainty quantification and analyzing its optimality across various risk measures.

Contribution

It proposes a novel, versatile method based on risk hull minimization for simultaneous variable selection and multiple testing in robust, dependent data settings, including uncertainty quantification.

Findings

01

The procedure effectively controls FDR, FPR, FWER, and other risks.

02

Introduces the first uncertainty quantification approach in robust variable selection.

03

Demonstrates weak optimality of the proposed method.

Abstract

We study the problem of identifying the set of \emph{active} variables, termed in the literature as \emph{variable selection} or \emph{multiple hypothesis testing}, depending on the pursued criteria. For a general \emph{robust setting} of non-normal, possibly dependent observations and a generalized notion of \emph{active set}, we propose a procedure that is used simultaneously for the both tasks, variable selection and multiple testing. The procedure is based on the \emph{risk hull minimization} method, but can also be obtained as a result of an empirical Bayes approach or a penalization strategy. We address its quality via various criteria: the Hamming risk, FDR, FPR, FWER, NDR, FNR,and various \emph{multiple testing risks}, e.g., MTR=FDR+NDR; and discuss a weak optimality of our results. Finally, we introduce and study, for the first time, the \emph{uncertainty quantification…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Statistical Process Monitoring · Fault Detection and Control Systems · Statistical Methods in Clinical Trials