TL;DR
This paper introduces a novel method for generating counterfactual explanations in medical imaging by leveraging a Diffusion Autoencoder's latent space, improving interpretability and enabling visualization of decision boundaries without requiring labeled data.
Contribution
The method operates directly on the DAE's latent space to produce both binary and ordinal counterfactual explanations, enhancing interpretability in medical image classification and regression tasks.
Findings
Effective in classifying medical conditions like VCF and DR.
Supports visualization of continuous decision boundaries.
Demonstrates improved interpretability over existing methods.
Abstract
Counterfactual explanations (CEs) aim to enhance the interpretability of machine learning models by illustrating how alterations in input features would affect the resulting predictions. Common CE approaches require an additional model and are typically constrained to binary counterfactuals. In contrast, we propose a novel method that operates directly on the latent space of a generative model, specifically a Diffusion Autoencoder (DAE). This approach offers inherent interpretability by enabling the generation of CEs and the continuous visualization of the model's internal representation across decision boundaries. Our method leverages the DAE's ability to encode images into a semantically rich latent space in an unsupervised manner, eliminating the need for labeled data or separate feature extraction models. We show that these latent representations are helpful for medical condition…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDiffusion
