How do Humans Understand Explanations from Machine Learning Systems? An Evaluation of the Human-Interpretability of Explanation
Menaka Narayanan, Emily Chen, Jeffrey He, Been Kim, Sam Gershman,, Finale Doshi-Velez

TL;DR
This paper investigates what makes machine learning explanations truly human-interpretable by conducting user studies to assess how explanation complexity affects human verification time.
Contribution
It provides empirical insights into the factors influencing human interpretability of explanations in machine learning, focusing on verification tasks.
Findings
Increased explanation complexity significantly raises verification time.
Some types of explanation complexity have minimal impact on human verification.
The study offers guidelines for designing more interpretable machine learning explanations.
Abstract
Recent years have seen a boom in interest in machine learning systems that can provide a human-understandable rationale for their predictions or decisions. However, exactly what kinds of explanation are truly human-interpretable remains poorly understood. This work advances our understanding of what makes explanations interpretable in the specific context of verification. Suppose we have a machine learning system that predicts X, and we provide rationale for this prediction X. Given an input, an explanation, and an output, is the output consistent with the input and the supposed rationale? Via a series of user-studies, we identify what kinds of increases in complexity have the greatest effect on the time it takes for humans to verify the rationale, and which seem relatively insensitive.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
