Trustworthy image-to-image translation: evaluating uncertainty calibration in unpaired training scenarios
Ciaran Bench, Emir Ahmed, Spencer A. Thomas

TL;DR
This paper evaluates the calibration and trustworthiness of unpaired image-to-image translation models, specifically cycleGAN and SynDiff, in medical imaging, using uncertainty quantification to improve their reliability without ground truth data.
Contribution
It introduces a scheme to assess calibration quality of unpaired translation models, enhancing trustworthiness in medical imaging applications.
Findings
Uncertainty quantification improves model trustworthiness.
Calibration evaluation scheme is effective without ground truths.
CycleGAN and SynDiff show different calibration behaviors.
Abstract
Mammographic screening is an effective method for detecting breast cancer, facilitating early diagnosis. However, the current need to manually inspect images places a heavy burden on healthcare systems, spurring a desire for automated diagnostic protocols. Techniques based on deep neural networks have been shown effective in some studies, but their tendency to overfit leaves considerable risk for poor generalisation and misdiagnosis, preventing their widespread adoption in clinical settings. Data augmentation schemes based on unpaired neural style transfer models have been proposed that improve generalisability by diversifying the representations of training image features in the absence of paired training data (images of the same tissue in either image style). But these models are similarly prone to various pathologies, and evaluating their performance is challenging without ground…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI)
