CBMA: Improving conformal prediction through Bayesian model averaging
Pankaj Bhagwat, Linglong Kong, Bei Jiang

TL;DR
This paper introduces CBMA, a hybrid conformal prediction method combining Bayesian model averaging to enhance robustness and efficiency in predictive inference, especially under model misspecification.
Contribution
It proposes a novel Bayesian model averaging approach integrated with conformal prediction, ensuring asymptotic optimality even when models are misspecified.
Findings
Prediction sets converge to optimal efficiency if the true model is included.
CBMA outperforms traditional conformal methods under model uncertainty.
Theoretical guarantees of convergence and robustness are established.
Abstract
Conformal prediction has emerged as a popular technique for facilitating valid predictive inference across a spectrum of machine learning models, under minimal assumption of exchangeability. Recently, Hoff (2023) showed that full conformal Bayes provides the most efficient prediction sets (smallest by expected volume) among all prediction sets that are valid at the level if the model is correctly specified. However, a critical issue arises when the Bayesian model itself may be mis-specified, resulting in prediction set that might be suboptimal, even though it still enjoys the frequentist coverage guarantee. To address this limitation, we propose an innovative solution that combines Bayesian model averaging (BMA) with conformal prediction. This hybrid not only leverages the strengths of Bayesian conformal prediction but also introduces a layer of robustness through model…
Peer Reviews
Decision·ICLR 2025 Poster
I found the article quite well written. Moreover, the proposal is of clear theoretical and applied interest, and developed with care and mathematical soundness.
I fear that the authors have oversold their claim. In fact, in the literature proposals of conformal model averaging can be found (https://www.sciencedirect.com/science/article/pii/S0925231219316108, https://arxiv.org/abs/2408.06642#:~:text=Unlike%20traditional%20methods%2C%20conformal%20ensembling,%2Dto%2Dinterpret%20uncertainty%20estimates. to name a few). I found the simulation quite lacking in terms of depth, as well and presentation of the results
While several works in the field of conformal prediction have applied a Bayesian perspective, this paper is meaningful since it is the first to attempt combining “multiple models” through Bayesian model averaging, marking a novel contribution. The proposed framework is straightforward, integrating traditional Bayesian model averaging directly, yet it is powerful as it incorporates model uncertainty more fully into the framework.
This work is limited by a lack of discussion comparing its approach to existing model aggregation methods in conformal prediction including frequentist’s perspective. Although Section 2.5 touches on related works, the primary advantage highlighted is the ability to use the full data, which may be better understood as a contribution from Fong & Holmes (2021) rather than an original novelty here. To justify the method empirically, it would be more reasonable to benchmark a few of these existing me
The results presented in the paper confirm the effectiveness of the proposed method.
1.The novelty of the work is not very significant, as it simply combines several existing methods. 2.The names of the subsections under the "Simulation" section are not entirely appropriate, since quadratic model is a type of non-linear model. It may be more appropriate to title whether the model is polynomial. 3.The paper seems not explain why the results for model 10 and model 11 are missing when $n=50$ in Table 2. 4.The paper contains a few typographical errors. For example, the last sen
Videos
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning
