Validation of Conformal Prediction in Cervical Atypia Classification
Misgina Tsighe Hagos, Antti Suutala, Dmitrii Bychkov, Hakan K\"uc\"ukel, Joar von Bahr, Milda Poceviciute, Johan Lundin, Nina Linder, Claes Lundstr\"om

TL;DR
This study validates conformal prediction methods for cervical atypia classification, highlighting their limitations in aligning with human labels and their potential in identifying ambiguous or out-of-distribution cases.
Contribution
It provides a comprehensive validation of conformal prediction in medical diagnosis, emphasizing the importance of truthful prediction sets aligned with human expectations.
Findings
Conventional coverage metrics overestimate performance.
Current methods often produce misaligned prediction sets.
Conformal prediction can identify ambiguous and out-of-distribution data.
Abstract
Deep learning based cervical cancer classification can potentially increase access to screening in low-resource regions. However, deep learning models are often overconfident and do not reliably reflect diagnostic uncertainty. Moreover, they are typically optimized to generate maximum-likelihood predictions, which fail to convey uncertainty or ambiguity in their results. Such challenges can be addressed using conformal prediction, a model-agnostic framework for generating prediction sets that contain likely classes for trained deep-learning models. The size of these prediction sets indicates model uncertainty, contracting as model confidence increases. However, existing conformal prediction evaluation primarily focuses on whether the prediction set includes or covers the true class, often overlooking the presence of extraneous classes. We argue that prediction sets should be truthful…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging and Analysis
MethodsSparse Evolutionary Training · ALIGN
