$V$-statistics and Variance Estimation
Zhengze Zhou, Lucas Mentch, Giles Hooker

TL;DR
This paper introduces a comprehensive framework for analyzing the asymptotic behavior of V-statistics, extending existing results to cases where kernel size grows with sample size, and offers improved variance estimation methods.
Contribution
It provides a novel reduction of V-statistics to U-statistics for asymptotic analysis and proposes a more accurate variance estimator applicable to both types.
Findings
Asymptotic normality holds when kernel size grows with sample size.
Unified variance estimation method for U- and V-statistics.
Application to ensemble methods like random forests.
Abstract
This paper develops a general framework for analyzing asymptotics of -statistics. Previous literature on limiting distribution mainly focuses on the cases when with fixed kernel size . Under some regularity conditions, we demonstrate asymptotic normality when grows with by utilizing existing results for -statistics. The key in our approach lies in a mathematical reduction to -statistics by designing an equivalent kernel for -statistics. We also provide a unified treatment on variance estimation for both - and -statistics by observing connections to existing methods and proposing an empirically more accurate estimator. Ensemble methods such as random forests, where multiple base learners are trained and aggregated for prediction purposes, serve as a running example throughout the paper because they are a natural and flexible application of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Advanced Statistical Methods and Models
