Dimension-agnostic inference using cross U-statistics

Ilmun Kim; Aaditya Ramdas

arXiv:2011.05068·math.ST·May 14, 2024·5 cites

Dimension-agnostic inference using cross U-statistics

Ilmun Kim, Aaditya Ramdas

PDF

Open Access

TL;DR

This paper develops a dimension-agnostic statistical inference method using cross U-statistics, enabling valid tests regardless of the relationship between sample size and dimensionality, and achieves minimax optimal power.

Contribution

It introduces a novel approach combining variational representations, sample splitting, and self-normalization to create a test statistic with a Gaussian limit independent of dimensionality.

Findings

01

The method provides valid inference for any dimension-to-sample size ratio.

02

It achieves minimax rate-optimal power against local alternatives.

03

Matches high-dimensional power of traditional U-statistics up to a factor of rac{rac{1}{2}}.

Abstract

Classical asymptotic theory for statistical inference usually involves calibrating a statistic by fixing the dimension $d$ while letting the sample size $n$ increase to infinity. Recently, much effort has been dedicated towards understanding how these methods behave in high-dimensional settings, where $d$ and $n$ both increase to infinity together. This often leads to different inference procedures, depending on the assumptions about the dimensionality, leaving the practitioner in a bind: given a dataset with 100 samples in 20 dimensions, should they calibrate by assuming $n ≫ d$ , or $d / n \approx 0.2$ ? This paper considers the goal of dimension-agnostic inference; developing methods whose validity does not depend on any assumption on $d$ versus $n$ . We introduce an approach that uses variational representations of existing test statistics along with sample splitting and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Machine Learning and Algorithms · Bayesian Methods and Mixture Models