Incomplete U-Statistics of Equireplicate Designs: Berry-Esseen Bound and Efficient Construction
Cesare Miglioli, Jordan Awan

TL;DR
This paper develops a new theoretical framework for incomplete U-statistics using hypergraph theory, providing Berry-Esseen bounds and efficient construction methods, especially for degenerate cases and equireplicate designs, with applications in nonparametric testing.
Contribution
It introduces a novel hypergraph-based approach to analyze incomplete U-statistics, deriving Berry-Esseen bounds and proposing efficient algorithms for their construction in deterministic designs.
Findings
Berry-Esseen bounds for incomplete U-statistics in degenerate cases
Conditions for Gaussian limiting distributions in diverging order scenarios
Efficient algorithms for constructing equireplicate incomplete U-statistics
Abstract
U-statistics are a fundamental class of estimators that generalize the sample mean and underpin much of nonparametric statistics. Although extensively studied in both statistics and probability, key challenges remain: their high computational cost - addressed partly through incomplete U-statistics - and their non-standard asymptotic behavior in the degenerate case, which typically requires resampling methods for hypothesis testing. This paper presents a novel perspective on U-statistics, grounded in hypergraph theory and combinatorial designs. Our approach bypasses the traditional Hoeffding decomposition, the main analytical tool in this literature but one that is highly sensitive to degeneracy. By characterizing the dependence structure of a U-statistic, we derive a Berry-Esseen bound valid for incomplete U-statistics of deterministic designs, yielding conditions under which Gaussian…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimal Experimental Design Methods · Statistical Methods in Clinical Trials · Bayesian Methods and Mixture Models
