CoLa-DCE -- Concept-guided Latent Diffusion Counterfactual Explanations
Franz Motzkus, Christian Hellert, Ute Schmid

TL;DR
CoLa-DCE introduces a novel method for generating transparent, concept-guided counterfactual explanations for image classifiers, enhancing interpretability by controlling feature changes and visualizing their spatial localization.
Contribution
It presents a new diffusion-based approach that improves the transparency and granularity of counterfactual explanations in computer vision models.
Findings
Enhanced control over concept selection and spatial conditioning.
Counterfactuals with minimal feature changes and better interpretability.
Insights into model errors like misclassification cases.
Abstract
Recent advancements in generative AI have introduced novel prospects and practical implementations. Especially diffusion models show their strength in generating diverse and, at the same time, realistic features, positioning them well for generating counterfactual explanations for computer vision models. Answering "what if" questions of what needs to change to make an image classifier change its prediction, counterfactual explanations align well with human understanding and consequently help in making model behavior more comprehensible. Current methods succeed in generating authentic counterfactuals, but lack transparency as feature changes are not directly perceivable. To address this limitation, we introduce Concept-guided Latent Diffusion Counterfactual Explanations (CoLa-DCE). CoLa-DCE generates concept-guided counterfactuals for any classifier with a high degree of control…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBusiness Process Modeling and Analysis · Topic Modeling · Explainable Artificial Intelligence (XAI)
MethodsALIGN · Counterfactuals Explanations · Diffusion
