CX-ToM: Counterfactual Explanations with Theory-of-Mind for Enhancing   Human Trust in Image Recognition Models

Arjun R. Akula; Keze Wang; Changsong Liu; Sari Saba-Sadiya; Hongjing; Lu; Sinisa Todorovic; Joyce Chai; and Song-Chun Zhu

arXiv:2109.01401·cs.AI·December 6, 2021

CX-ToM: Counterfactual Explanations with Theory-of-Mind for Enhancing Human Trust in Image Recognition Models

Arjun R. Akula, Keze Wang, Changsong Liu, Sari Saba-Sadiya, Hongjing, Lu, Sinisa Todorovic, Joyce Chai, and Song-Chun Zhu

PDF

Open Access 1 Repo

TL;DR

CX-ToM introduces an interactive, theory-of-mind-based framework for generating counterfactual explanations in image recognition, improving human trust and understanding of CNN decisions through iterative dialogue.

Contribution

It presents a novel, dialog-based XAI framework using counterfactual explanations and theory of mind, surpassing traditional attention-based methods in fostering trust.

Findings

01

CX-ToM outperforms existing XAI models in experiments.

02

Counterfactual explanations enhance user understanding.

03

Iterative explanations foster greater trust in CNNs.

Abstract

We propose CX-ToM, short for counterfactual explanations with theory-of mind, a new explainable AI (XAI) framework for explaining decisions made by a deep convolutional neural network (CNN). In contrast to the current methods in XAI that generate explanations as a single shot response, we pose explanation as an iterative communication process, i.e. dialog, between the machine and human user. More concretely, our CX-ToM framework generates sequence of explanations in a dialog by mediating the differences between the minds of machine and human user. To do this, we use Theory of Mind (ToM) which helps us in explicitly modeling human's intention, machine's mind as inferred by the human as well as human's mind as inferred by the machine. Moreover, most state-of-the-art XAI frameworks provide attention (or heat map) based explanations. In our work, we show that these attention based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

arjunakula/faultline_explainer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Artificial Intelligence in Healthcare and Education