Adapting Prediction Sets to Distribution Shifts Without Labels
Kevin Kasa, Zhiyu Zhang, Heng Yang, Graham W. Taylor

TL;DR
This paper introduces ECP and EACP, two methods that adapt conformal prediction sets to distribution shifts using only unlabeled data, significantly improving their robustness without requiring labels.
Contribution
The paper proposes two novel methods, ECP and EACP, for improving conformal prediction under distribution shifts using unlabeled data, without needing ground truth labels.
Findings
ECP and EACP outperform existing baselines.
Methods nearly match fully supervised performance.
Consistent improvements across large-scale datasets.
Abstract
Recently there has been a surge of interest to deploy confidence set predictions rather than point predictions in machine learning. Unfortunately, the effectiveness of such prediction sets is frequently impaired by distribution shifts in practice, and the challenge is often compounded by the lack of ground truth labels at test time. Focusing on a standard set-valued prediction framework called conformal prediction (CP), this paper studies how to improve its practical performance using only unlabeled data from the shifted test domain. This is achieved by two new methods called ECP and EACP, whose main idea is to adjust the score function in CP according to its base model's own uncertainty evaluation. Through extensive experiments on a number of large-scale datasets and neural network architectures, we show that our methods provide consistent improvement over existing baselines and nearly…
Peer Reviews
Decision·UAI 2025 Poster
The investigation presented in this paper on Conformal Prediction (CP) under distribution shifts is of considerable value, addressing an area that has received relatively little attention in existing studies.
The motivation behind this paper is that, under distribution shifts the overall output confidence of models tends to decrease, thereby affecting the performance of CP. However, there are several aspects in the method design that currently fail to convince me, such as: 1. Figure 1 observes the relationship between the model's output confidence and entropy, using this relationship as a key observation to guide the method design. But isn't this correlation trivial? Entropy and confidence are both
This paper attacks an important problem of learning conformal sets under distribution shifts. The observation is simple and intuitive. It demonstrated the efficacy of the proposed method by using at least 6 shifted datasets.
I think the main concern of this paper is that the proposed method does not provide any theoretical guarantee, which is the crux of conformal prediction. Including this, see the following for additional concerns. * The proposed method is heuristic, which does not align with paper trends of conformal prediction. Can you provide a theoretical coverage guarantee (i.e., the coverage probability bound) of the proposed method under the covariate shift assumption? * For achieving a desired coverage,
The preliminaries are well formulated.
Please refer to the questions below
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Time Series Analysis and Forecasting · Neural Networks and Applications
MethodsBalanced Selection
