Stochastic Online Conformal Prediction with Semi-Bandit Feedback

Haosen Ge; Hamsa Bastani; Osbert Bastani

arXiv:2405.13268·cs.LG·May 28, 2025

Stochastic Online Conformal Prediction with Semi-Bandit Feedback

Haosen Ge, Hamsa Bastani, Osbert Bastani

PDF

Open Access

TL;DR

This paper introduces a new online conformal prediction method that operates under semi-bandit feedback, providing high-probability label set guarantees with sublinear regret in dynamic data environments.

Contribution

It proposes a novel conformal prediction algorithm for online semi-bandit feedback settings, with theoretical regret guarantees and empirical validation across multiple tasks.

Findings

01

Achieves sublinear regret compared to the optimal conformal predictor.

02

Performs well empirically on retrieval, image classification, and auction tasks.

03

Outperforms several baseline methods in experiments.

Abstract

Conformal prediction has emerged as an effective strategy for uncertainty quantification by modifying a model to output sets of labels instead of a single label. These prediction sets come with the guarantee that they contain the true label with high probability. However, conformal prediction typically requires a large calibration dataset of i.i.d. examples. We consider the online learning setting, where examples arrive over time, and the goal is to construct prediction sets dynamically. Departing from existing work, we assume semi-bandit feedback, where we only observe the true label if it is contained in the prediction set. For instance, consider calibrating a document retrieval model to a new domain; in this setting, a user would only be able to provide the true label if the target document is in the prediction set of retrieved documents. We propose a novel conformal prediction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Machine Learning and ELM

MethodsSparse Evolutionary Training