Counterfactual Explanations via Riemannian Latent Space Traversal
Paraskevas Pegios, Aasa Feragen, Andreas Abildtrup Hansen, Georgios, Arvanitidis

TL;DR
This paper proposes a novel method for generating counterfactual explanations by leveraging a Riemannian metric in the latent space of generative models, accounting for the data's geometric complexity to produce more natural and reliable counterfactuals.
Contribution
It introduces a Riemannian metric-based approach for counterfactual explanations that considers the nonlinear decoder's geometry, improving the realism of generated counterfactuals.
Findings
Produces high-fidelity counterfactual trajectories
Outperforms naive latent space traversal methods
Effective on real-world tabular datasets
Abstract
The adoption of increasingly complex deep models has fueled an urgent need for insight into how these models make predictions. Counterfactual explanations form a powerful tool for providing actionable explanations to practitioners. Previously, counterfactual explanation methods have been designed by traversing the latent space of generative models. Yet, these latent spaces are usually greatly simplified, with most of the data distribution complexity contained in the decoder rather than the latent embedding. Thus, traversing the latent space naively without taking the nonlinear decoder into account can lead to unnatural counterfactual trajectories. We introduce counterfactual explanations obtained using a Riemannian metric pulled back via the decoder and the classifier under scrutiny. This metric encodes information about the complex geometric structure of the data and the learned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis · Machine Learning in Healthcare
