X-ToM: Explaining with Theory-of-Mind for Gaining Justified Human Trust
Arjun R. Akula, Changsong Liu, Sari Saba-Sadiya, Hongjing Lu, Sinisa, Todorovic, Joyce Y. Chai, Song-Chun Zhu

TL;DR
This paper introduces X-ToM, an explainable AI framework that uses Theory of Mind to generate iterative, dialog-based explanations, enhancing justified human trust and understanding across visual recognition tasks.
Contribution
It is the first to incorporate Theory of Mind into XAI for modeling human and machine mental states, improving explanation naturalness and trust.
Findings
Significantly outperforms state-of-the-art XAI methods in trust and reliance metrics.
Effective across multiple visual recognition tasks.
Human studies confirm improved explanation satisfaction.
Abstract
We present a new explainable AI (XAI) framework aimed at increasing justified human trust and reliance in the AI machine through explanations. We pose explanation as an iterative communication process, i.e. dialog, between the machine and human user. More concretely, the machine generates sequence of explanations in a dialog which takes into account three important aspects at each dialog turn: (a) human's intention (or curiosity); (b) human's understanding of the machine; and (c) machine's understanding of the human user. To do this, we use Theory of Mind (ToM) which helps us in explicitly modeling human's intention, machine's mind as inferred by the human as well as human's mind as inferred by the machine. In other words, these explicit mental representations in ToM are incorporated to learn an optimal explanation policy that takes into account human's perception and beliefs.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Artificial Intelligence in Healthcare and Education
