It Takes Two to Tango: Towards Theory of AI's Mind
Arjun Chandrasekaran, Deshraj Yadav, Prithvijit Chattopadhyay, Viraj, Prabhu, Devi Parikh

TL;DR
This paper emphasizes the importance of humans developing a theory of AI's mind for effective collaboration, demonstrating that minimal training can improve prediction of AI responses in Visual Question Answering, despite current interpretability tools.
Contribution
It introduces the concept of humans developing a theory of AI's mind (ToAIM) and empirically shows that brief training improves prediction of AI behavior in VQA tasks, challenging the effectiveness of existing interpretability methods.
Findings
Few-shot training (50 examples) improves human prediction of AI responses.
Existing interpretability tools like confidence scores and attention maps do not enhance prediction accuracy.
Humans can learn to anticipate AI failures with minimal training.
Abstract
Theory of Mind is the ability to attribute mental states (beliefs, intents, knowledge, perspectives, etc.) to others and recognize that these mental states may differ from one's own. Theory of Mind is critical to effective communication and to teams demonstrating higher collective performance. To effectively leverage the progress in Artificial Intelligence (AI) to make our lives more productive, it is important for humans and AI to work well together in a team. Traditionally, there has been much emphasis on research to make AI more accurate, and (to a lesser extent) on having it better understand human intentions, tendencies, beliefs, and contexts. The latter involves making AI more human-like and having it develop a theory of our minds. In this work, we argue that for human-AI teams to be effective, humans must also develop a theory of AI's mind (ToAIM) - get to know its strengths,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Visual Attention and Saliency Detection
