Pitfalls of Conformal Predictions for Medical Image Classification
Hendrik Mehrtens, Tabea Bucher, Titus J. Brinker

TL;DR
This paper critically examines conformal predictions in medical image classification, highlighting their unreliability under distributional shifts and limitations in small-class scenarios, which are common in medical applications.
Contribution
It identifies key pitfalls and limitations of conformal predictions in medical imaging, emphasizing the need for caution in safety-critical applications.
Findings
Unreliable under distributional shifts in medical images
Not suitable for subset predictions like individual classes
Limited practical value in small-class classification tasks
Abstract
Reliable uncertainty estimation is one of the major challenges for medical classification tasks. While many approaches have been proposed, recently the statistical framework of conformal predictions has gained a lot of attention, due to its ability to provide provable calibration guarantees. Nonetheless, the application of conformal predictions in safety-critical areas such as medicine comes with pitfalls, limitations and assumptions that practitioners need to be aware of. We demonstrate through examples from dermatology and histopathology that conformal predictions are unreliable under distributional shifts in input and label variables. Additionally, conformal predictions should not be used for selecting predictions to improve accuracy and are not reliable for subsets of the data, such as individual classes or patient attributes. Moreover, in classification settings with a small number…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
