DeCoDEx: Confounder Detector Guidance for Improved Diffusion-based Counterfactual Explanations
Nima Fathi, Amar Kumar, Brennan Nichyporuk, Mohammad Havaei, Tal Arbel

TL;DR
DeCoDEx introduces a framework that uses an external artifact detector to guide diffusion-based counterfactual image generation, improving explainability and bias mitigation in medical image classifiers.
Contribution
The paper proposes a novel method combining artifact detection with diffusion models to enhance counterfactual explanations and reduce bias in deep learning classifiers.
Findings
Successfully generates counterfactual images that alter causal markers while ignoring artifacts.
Improves classifier performance on underrepresented, out-of-distribution groups.
Demonstrates effectiveness on CheXpert dataset with synthetic and real artifacts.
Abstract
Deep learning classifiers are prone to latching onto dominant confounders present in a dataset rather than on the causal markers associated with the target class, leading to poor generalization and biased predictions. Although explainability via counterfactual image generation has been successful at exposing the problem, bias mitigation strategies that permit accurate explainability in the presence of dominant and diverse artifacts remain unsolved. In this work, we propose the DeCoDEx framework and show how an external, pre-trained binary artifact detector can be leveraged during inference to guide a diffusion-based counterfactual image generator towards accurate explainability. Experiments on the CheXpert dataset, using both synthetic artifacts and real visual artifacts (support devices), show that the proposed method successfully synthesizes the counterfactual images that change the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Scientific Computing and Data Management · Explainable Artificial Intelligence (XAI)
