CX-ToM: Counterfactual Explanations with Theory-of-Mind for Enhancing Human Trust in Image Recognition Models
Arjun R. Akula, Keze Wang, Changsong Liu, Sari Saba-Sadiya, Hongjing, Lu, Sinisa Todorovic, Joyce Chai, and Song-Chun Zhu

TL;DR
CX-ToM introduces an interactive, theory-of-mind-based framework for generating counterfactual explanations in image recognition, improving human trust and understanding of CNN decisions through iterative dialogue.
Contribution
It presents a novel, dialog-based XAI framework using counterfactual explanations and theory of mind, surpassing traditional attention-based methods in fostering trust.
Findings
CX-ToM outperforms existing XAI models in experiments.
Counterfactual explanations enhance user understanding.
Iterative explanations foster greater trust in CNNs.
Abstract
We propose CX-ToM, short for counterfactual explanations with theory-of mind, a new explainable AI (XAI) framework for explaining decisions made by a deep convolutional neural network (CNN). In contrast to the current methods in XAI that generate explanations as a single shot response, we pose explanation as an iterative communication process, i.e. dialog, between the machine and human user. More concretely, our CX-ToM framework generates sequence of explanations in a dialog by mediating the differences between the minds of machine and human user. To do this, we use Theory of Mind (ToM) which helps us in explicitly modeling human's intention, machine's mind as inferred by the human as well as human's mind as inferred by the machine. Moreover, most state-of-the-art XAI frameworks provide attention (or heat map) based explanations. In our work, we show that these attention based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Artificial Intelligence in Healthcare and Education
