Learning U-Statistics with Active Inference
Xiaoning Wang, Yuyang Huo, Liuhua Peng, and Changliang Zou

TL;DR
This paper introduces an active inference framework for U-statistics that improves estimation efficiency by selectively querying labels, applicable to statistical inference and empirical risk minimization.
Contribution
It develops a novel active inference approach for U-statistics, including optimal sampling strategies and extensions to empirical risk minimization, with demonstrated empirical benefits.
Findings
Significant efficiency gains over baseline methods in real datasets
Optimal sampling rule minimizes variance of the augmented inverse probability weighting U-statistic
Framework maintains valid statistical inference and coverage
Abstract
-statistics play a central role in statistical inference. In many modern applications, however, acquiring the labels required for -statistics is costly. Motivated by recent advances in active inference, we develop an active inference framework for -statistics that selectively queries informative labels to improve estimation efficiency under a fixed labeling budget, while preserving valid statistical inference. Our approach is built on the augmented inverse probability weighting -statistic, which is designed to incorporate the sampling rule and machine learning predictions. We characterize the optimal sampling rule that minimizes its variance and design practical sampling strategies. We further extend the framework to -statistic-based empirical risk minimization. Experiments on real datasets demonstrate substantial gains in estimation efficiency over baseline methods,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
