Prediction-Powered Conditional Inference
Yang Sui, Jin Zhou, Hua Zhou, and Xiaowu Dai

TL;DR
This paper introduces a novel prediction-powered inference method that leverages machine learning predictions and unlabeled data to improve the accuracy and efficiency of conditional functional estimation without assuming parametric models.
Contribution
It develops a kernel-based localization technique combined with prediction correction to produce valid, efficient confidence intervals for conditional functionals in scarce-label settings.
Findings
Achieves minimax-optimal convergence rates.
Provides valid confidence intervals with improved precision.
Demonstrates effectiveness on real and simulated data.
Abstract
We study prediction-powered conditional inference in the setting where labeled data are scarce, unlabeled covariates are abundant, and a black-box machine-learning predictor is available. The goal is to perform statistical inference on conditional functionals evaluated at a fixed test point, such as conditional means, without imposing a parametric model for the conditional relationship. Our approach combines localization with prediction-based variance reduction. First, we introduce a reproducing kernel-based localization method that learns a data-adaptive weight function from covariates and reformulates the target conditional moment at the test point as a weighted unconditional moment. Second, we incorporate machine-learning predictions through a correction-based decomposition of this localized moment, yielding a prediction-powered estimator and confidence interval that reduce variance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning
