Prediction-Powered Inference with Inverse Probability Weighting

Jyotishka Datta; Nicholas G. Polson

arXiv:2508.10149·stat.ML·March 25, 2026

Prediction-Powered Inference with Inverse Probability Weighting

Jyotishka Datta, Nicholas G. Polson

PDF

TL;DR

This paper introduces a new interpretation of prediction-powered inference (PPI) that integrates survey sampling techniques, allowing for valid inference with estimated inclusion probabilities and partially labeled data.

Contribution

It provides a direct design-based interpretation of PPI, connecting it with survey sampling methods like Horvitz--Thompson corrections, and demonstrates its effectiveness with estimated propensities.

Findings

01

IPW-adjusted PPI with estimated propensities maintains nominal coverage.

02

Performance closely matches the known-probability case in simulations.

03

Retains variance reduction benefits of PPI.

Abstract

Prediction-powered inference (PPI) is a recent framework for valid statistical inference with partially labeled data, combining model-based predictions on a large unlabeled set with bias correction from a smaller labeled subset. Building on existing PPI results under covariate shift, we show that PPI rectification admits a direct design-based interpretation, and that informative labeling can be handled naturally by Horvitz--Thompson and H\'ajek-style corrections. This connection unites design-based survey sampling ideas with modern prediction-assisted inference, yielding estimators that remain valid when labeling probabilities vary across units. We consider the common setting where the inclusion probabilities are not known but estimated from a correctly specified model. In simulations, the performance of IPW-adjusted PPI with estimated propensities closely matches the known-probability…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.