Semi-Supervised Risk Control via Prediction-Powered Inference

Bat-Sheva Einbinder; Liran Ringel; Yaniv Romano

arXiv:2412.11174·cs.LG·July 29, 2025

Semi-Supervised Risk Control via Prediction-Powered Inference

Bat-Sheva Einbinder, Liran Ringel, Yaniv Romano

PDF

Open Access

TL;DR

This paper introduces a semi-supervised calibration method for risk-controlling prediction sets that uses unlabeled data to improve error rate tuning, demonstrated through real-data experiments.

Contribution

It proposes a novel semi-supervised calibration procedure for RCPS that leverages unlabeled data to reduce conservativeness and improve error control.

Findings

01

Improved error rate control in limited data scenarios

02

Effective application to few-shot image classification

03

Enhanced early time series classification performance

Abstract

The risk-controlling prediction sets (RCPS) framework is a general tool for transforming the output of any machine learning model to design a predictive rule with rigorous error rate control. The key idea behind this framework is to use labeled hold-out calibration data to tune a hyper-parameter that affects the error rate of the resulting prediction rule. However, the limitation of such a calibration scheme is that with limited hold-out data, the tuned hyper-parameter becomes noisy and leads to a prediction rule with an error rate that is often unnecessarily conservative. To overcome this sample-size barrier, we introduce a semi-supervised calibration procedure that leverages unlabeled data to rigorously tune the hyper-parameter without compromising statistical validity. Our procedure builds upon the prediction-powered inference framework, carefully tailoring it to risk-controlling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Machine Learning in Healthcare