Pitfalls of Conformal Predictions for Medical Image Classification

Hendrik Mehrtens; Tabea Bucher; Titus J. Brinker

arXiv:2506.18162·cs.LG·June 24, 2025

Pitfalls of Conformal Predictions for Medical Image Classification

Hendrik Mehrtens, Tabea Bucher, Titus J. Brinker

PDF

TL;DR

This paper critically examines conformal predictions in medical image classification, highlighting their unreliability under distributional shifts and limitations in small-class scenarios, which are common in medical applications.

Contribution

It identifies key pitfalls and limitations of conformal predictions in medical imaging, emphasizing the need for caution in safety-critical applications.

Findings

01

Unreliable under distributional shifts in medical images

02

Not suitable for subset predictions like individual classes

03

Limited practical value in small-class classification tasks

Abstract

Reliable uncertainty estimation is one of the major challenges for medical classification tasks. While many approaches have been proposed, recently the statistical framework of conformal predictions has gained a lot of attention, due to its ability to provide provable calibration guarantees. Nonetheless, the application of conformal predictions in safety-critical areas such as medicine comes with pitfalls, limitations and assumptions that practitioners need to be aware of. We demonstrate through examples from dermatology and histopathology that conformal predictions are unreliable under distributional shifts in input and label variables. Additionally, conformal predictions should not be used for selecting predictions to improve accuracy and are not reliable for subsets of the data, such as individual classes or patient attributes. Moreover, in classification settings with a small number…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.