Counterfactual Explanations for Medical Image Classification and   Regression using Diffusion Autoencoder

Matan Atad; David Schinz; Hendrik Moeller; Robert Graf; Benedikt; Wiestler; Daniel Rueckert; Nassir Navab; Jan S. Kirschke; Matthias Keicher

arXiv:2408.01571·cs.CV·October 2, 2024

Counterfactual Explanations for Medical Image Classification and Regression using Diffusion Autoencoder

Matan Atad, David Schinz, Hendrik Moeller, Robert Graf, Benedikt, Wiestler, Daniel Rueckert, Nassir Navab, Jan S. Kirschke, Matthias Keicher

PDF

1 Repo

TL;DR

This paper introduces a novel method for generating counterfactual explanations in medical imaging by leveraging a Diffusion Autoencoder's latent space, improving interpretability and enabling visualization of decision boundaries without requiring labeled data.

Contribution

The method operates directly on the DAE's latent space to produce both binary and ordinal counterfactual explanations, enhancing interpretability in medical image classification and regression tasks.

Findings

01

Effective in classifying medical conditions like VCF and DR.

02

Supports visualization of continuous decision boundaries.

03

Demonstrates improved interpretability over existing methods.

Abstract

Counterfactual explanations (CEs) aim to enhance the interpretability of machine learning models by illustrating how alterations in input features would affect the resulting predictions. Common CE approaches require an additional model and are typically constrained to binary counterfactuals. In contrast, we propose a novel method that operates directly on the latent space of a generative model, specifically a Diffusion Autoencoder (DAE). This approach offers inherent interpretability by enabling the generation of CEs and the continuous visualization of the model's internal representation across decision boundaries. Our method leverages the DAE's ability to encode images into a semantically rich latent space in an unsupervised manner, eliminating the need for labeled data or separate feature extraction models. We show that these latent representations are helpful for medical condition…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

matanat/dae_counterfactual
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDiffusion