Leveraging Conditional Generative Models in a General Explanation Framework of Classifier Decisions
Martin Charachon, Paul-Henry Courn\`ede, C\'eline Hudelot, Roberto, Ardon

TL;DR
This paper introduces a novel framework for visual explanations of classifier decisions using conditional generative models, producing more accurate and human-aligned explanations than existing methods.
Contribution
It proposes a general explanation approach based on two trained generative models that produce contrasting images, improving explanation quality and consistency.
Findings
Significant improvements over state-of-the-art methods.
Generated explanations align well with human annotations.
Framework applicable across multiple datasets.
Abstract
Providing a human-understandable explanation of classifiers' decisions has become imperative to generate trust in their use for day-to-day tasks. Although many works have addressed this problem by generating visual explanation maps, they often provide noisy and inaccurate results forcing the use of heuristic regularization unrelated to the classifier in question. In this paper, we propose a new general perspective of the visual explanation problem overcoming these limitations. We show that visual explanation can be produced as the difference between two generated images obtained via two specific conditional generative models. Both generative models are trained using the classifier to explain and a database to enforce the following properties: (i) All images generated by the first generator are classified similarly to the input image, whereas the second generator's outputs are classified…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning
