Active Multiple-Prediction-Powered Inference
Nicholas Brawand, Nima Leclerc, Anhthy Ngo, Matthew Peterson, Sriram Vishwanath, Laith Alhussein, Ben Wellner

TL;DR
This paper introduces AM-PPI, a method that efficiently combines multiple predictors with different costs and accuracies for healthcare AI monitoring, optimizing label use and inference accuracy.
Contribution
AM-PPI extends prediction-powered inference to multiple predictors with adaptive routing, providing a globally optimal, variance-minimizing solution under a fixed budget.
Findings
AM-PPI achieves 10-40% narrower confidence intervals than single-predictor methods.
It generalizes ASI to multiple predictors with adaptive, per-instance routing.
The method performs well on synthetic and healthcare data, optimizing label efficiency.
Abstract
Post-deployment monitoring of healthcare AI requires statistically valid, label-efficient methods, but gold-standard labels from clinician chart review are expensive. Prediction-powered inference (PPI) and active statistical inference (ASI) reduce label cost by combining a small labeled sample with abundant model predictions, but both are restricted to a single predictor, a poor fit for modern clinical pipelines that have multiple predictors of differing cost and accuracy available at inference time. We propose Active Multiple-Prediction-Powered Inference (AM-PPI), which routes each instance to a cost-appropriate predictor subset, samples gold-standard labels in proportion to the chosen subset's residual uncertainty, and reweights predictions to minimize estimator variance, all under a single deployment-time budget. AM-PPI generalizes ASI to leverage multiple predictors and extends…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
