The Promise and Peril of Human Evaluation for Model Interpretability

Bernease Herman

arXiv:1711.07414·cs.AI·October 31, 2019·45 cites

The Promise and Peril of Human Evaluation for Model Interpretability

Bernease Herman

PDF

Open Access

TL;DR

This paper discusses the challenges and potential biases in using human evaluation for model interpretability, emphasizing the need to distinguish between descriptive and persuasive explanations to improve transparency.

Contribution

It introduces a distinction between descriptive and persuasive explanations and highlights the risk of cognitive bias in functional interpretability evaluations.

Findings

01

Functional interpretability may correlate with cognitive function.

02

Evaluation using functional metrics could reinforce implicit biases.

03

Two research directions are proposed to better understand explanation models.

Abstract

Transparency, user trust, and human comprehension are popular ethical motivations for interpretable machine learning. In support of these goals, researchers evaluate model explanation performance using humans and real world applications. This alone presents a challenge in many areas of artificial intelligence. In this position paper, we propose a distinction between descriptive and persuasive explanations. We discuss reasoning suggesting that functional interpretability may be correlated with cognitive function and user preferences. If this is indeed the case, evaluation and optimization using functional metrics could perpetuate implicit cognitive bias in explanations that threaten transparency. Finally, we propose two potential research directions to disambiguate cognitive function and explanation models, retaining control over the tradeoff between accuracy and interpretability.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification

MethodsInterpretability