Dimension-agnostic inference using cross U-statistics
Ilmun Kim, Aaditya Ramdas

TL;DR
This paper develops a dimension-agnostic statistical inference method using cross U-statistics, enabling valid tests regardless of the relationship between sample size and dimensionality, and achieves minimax optimal power.
Contribution
It introduces a novel approach combining variational representations, sample splitting, and self-normalization to create a test statistic with a Gaussian limit independent of dimensionality.
Findings
The method provides valid inference for any dimension-to-sample size ratio.
It achieves minimax rate-optimal power against local alternatives.
Matches high-dimensional power of traditional U-statistics up to a factor of rac{rac{1}{2}}.
Abstract
Classical asymptotic theory for statistical inference usually involves calibrating a statistic by fixing the dimension while letting the sample size increase to infinity. Recently, much effort has been dedicated towards understanding how these methods behave in high-dimensional settings, where and both increase to infinity together. This often leads to different inference procedures, depending on the assumptions about the dimensionality, leaving the practitioner in a bind: given a dataset with 100 samples in 20 dimensions, should they calibrate by assuming , or ? This paper considers the goal of dimension-agnostic inference; developing methods whose validity does not depend on any assumption on versus . We introduce an approach that uses variational representations of existing test statistics along with sample splitting and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Machine Learning and Algorithms · Bayesian Methods and Mixture Models
