Do More Predictions Improve Statistical Inference? Filtered Prediction-Powered Inference
Shirong Xu, Will Wei Sun

TL;DR
This paper introduces Filtered Prediction-Powered Inference (FPPI), a method that selectively uses predictions to improve statistical inference, especially when prediction quality varies, leading to more accurate and efficient results with less reliance on labeled data.
Contribution
FPPI is a novel framework that adaptively filters predictions based on data, improving inference accuracy and efficiency over existing methods in heterogeneous prediction quality settings.
Findings
FPPI achieves faster convergence rates under a margin condition.
It reduces reliance on labeled data while maintaining accuracy.
Numerical studies confirm improved inference with heterogeneous predictions.
Abstract
Recent advances in artificial intelligence have enabled the generation of large-scale, low-cost predictions with increasingly high fidelity. As a result, the primary challenge in statistical inference has shifted from data scarcity to data reliability. Prediction-powered inference methods seek to exploit such predictions to improve efficiency when labeled data are limited. However, existing approaches implicitly adopt a use-all philosophy, under which incorporating more predictions is presumed to improve inference. When prediction quality is heterogeneous, this assumption can fail, and indiscriminate use of unlabeled data may dilute informative signals and degrade inferential accuracy. In this paper, we propose Filtered Prediction-Powered Inference (FPPI), a framework that selectively incorporates predictions by identifying a data-adaptive filtered region in which predictions are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Machine Learning and Algorithms · Artificial Intelligence in Healthcare and Education
