Finite Sample t-Tests for High-Dimensional Means

Jun Li

arXiv:2203.08786·stat.ME·March 17, 2022·J. Multivar. Anal.

Finite Sample t-Tests for High-Dimensional Means

Jun Li

PDF

Open Access

TL;DR

This paper develops finite-sample t-tests for high-dimensional mean vectors that remain accurate with very small, fixed sample sizes, addressing size distortion issues in traditional asymptotic tests.

Contribution

It establishes asymptotic t-distributions for U-statistics in high-dimensional, small-sample settings, enabling more reliable testing.

Findings

01

Proposed tests maintain accurate sizes across various dimensions and sample sizes.

02

Simulation studies validate the theoretical asymptotic distributions.

03

Application to fMRI data demonstrates practical utility.

Abstract

Size distortion can occur if an asymptotic testing procedure requiring diverging sample sizes, is implemented to data with very small sample sizes. In this paper, we consider one-sample and two-sample tests for mean vectors when data are high-dimensional but sample sizes are very small. We establish asymptotic t-distributions of one-sample and two-sample U-statistics, which only require data dimensionality to diverge but sample sizes to be fixed and no less than 3. Simulation studies confirm the theoretical results that the proposed tests maintain accurate empirical sizes for a wide range of sample sizes and data dimensionalities. We apply the proposed tests to an fMRI dataset to demonstrate the practical implementation of the methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Gene expression and cancer classification · Bayesian Methods and Mixture Models