Power Analysis for Prediction-Powered Inference

Yiqun T. Chen; Moran Guo; Shengy Li

arXiv:2603.16041·stat.ME·March 18, 2026

Power Analysis for Prediction-Powered Inference

Yiqun T. Chen, Moran Guo, Shengy Li

PDF

Open Access

TL;DR

This paper develops power analysis formulas for prediction-powered inference (PPI), enabling researchers to determine the labeled sample size needed for desired statistical power when using high-predictive AI/ML models in various biomedical applications.

Contribution

It derives closed-form power formulas for PPI, accounting for the predictive power of AI/ML models, and validates these formulas through simulations and real-world biomedical examples.

Findings

01

Sample size reduction scales with R2 between predictions and ground truth.

02

Formulas are validated via Monte Carlo simulations.

03

Framework applied to biomedical data including transcriptomics and imaging.

Abstract

Modern studies increasingly leverage outcomes predicted by machine learning and artificial intelligence (AI/ML) models, and recent work, such as prediction-powered inference (PPI), has developed valid downstream statistical inference procedures. However, classical power and sample size formulas do not readily account for these predictions. In this work, we tackle a simple yet practical question: given a new AI/ML model with high predictive power, how many labeled samples are needed to achieve a desired level of statistical power? We derive closed-form power formulas by characterizing the asymptotic variance of the PPI estimator and applying Wald test inversion to obtain the required labeled sample size. Our results cover widely used settings including two-sample comparisons and risk measures in 2x2 tables. We find that a useful rule of thumb is that the reduction in required labeled…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSingle-cell and spatial transcriptomics · Statistical Methods and Inference · Explainable Artificial Intelligence (XAI)