Robust High-dimensional Tuning Free Multiple Testing

Jianqing Fan; Zhipeng Lou; Mengxin Yu

arXiv:2211.11959·math.ST·November 24, 2022

Robust High-dimensional Tuning Free Multiple Testing

Jianqing Fan, Zhipeng Lou, Mengxin Yu

PDF

Open Access

TL;DR

This paper introduces a robust, tuning-free, and moment-free high-dimensional multiple testing method based on the Hodges-Lehmann estimator, suitable for heavy-tailed data, with proven FDR control and validated through simulations.

Contribution

It develops a non-asymptotic theory for the Hodges-Lehmann estimator and extends it to large-scale, tuning-free multiple testing procedures with false discovery proportion control.

Findings

01

Methods control false discovery proportion at a prescribed level

02

Proposed procedures are robust to heavy-tailed data

03

Simulation studies support theoretical results

Abstract

A stylized feature of high-dimensional data is that many variables have heavy tails, and robust statistical inference is critical for valid large-scale statistical inference. Yet, the existing developments such as Winsorization, Huberization and median of means require the bounded second moments and involve variable-dependent tuning parameters, which hamper their fidelity in applications to large-scale problems. To liberate these constraints, this paper revisits the celebrated Hodges-Lehmann (HL) estimator for estimating location parameters in both the one- and two-sample problems, from a non-asymptotic perspective. Our study develops Berry-Esseen inequality and Cram\'{e}r type moderate deviation for the HL estimator based on newly developed non-asymptotic Bahadur representation, and builds data-driven confidence intervals via a weighted bootstrap approach. These results allow us to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Statistical Methods in Clinical Trials