VAE-CE: Visual Contrastive Explanation using Disentangled VAEs
Yoeri Poels, Vlado Menkovski

TL;DR
This paper introduces VAE-CE, a model that uses disentangled VAEs to represent high-level concepts for classification and contrastive explanations, improving interpretability over existing methods.
Contribution
The paper proposes a novel VAE-based model that generates contrastive explanations using disentangled representations, with a new supervised disentangling method.
Findings
Effective disentanglement on synthetic data and MNIST
Contrastive explanations improve interpretability
Outperforms other explanation methods
Abstract
The goal of a classification model is to assign the correct labels to data. In most cases, this data is not fully described by the given set of labels. Often a rich set of meaningful concepts exist in the domain that can much more precisely describe each datapoint. Such concepts can also be highly useful for interpreting the model's classifications. In this paper we propose a model, denoted as Variational Autoencoder-based Contrastive Explanation (VAE-CE), that represents data with high-level concepts and uses this representation for both classification and generating explanations. The explanations are produced in a contrastive manner, conveying why a datapoint is assigned to one class rather than an alternative class. An explanation is specified as a set of transformations of the input datapoint, with each step depicting a concept changing towards the contrastive class. We build the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications
