Quantifying Interpretability and Trust in Machine Learning Systems

Philipp Schmidt; Felix Biessmann

arXiv:1901.08558·cs.LG·January 25, 2019·71 cites

Quantifying Interpretability and Trust in Machine Learning Systems

Philipp Schmidt, Felix Biessmann

PDF

Open Access 1 Repo

TL;DR

This paper introduces quantitative metrics for evaluating interpretability methods and trust in ML decisions, demonstrating their effectiveness through empirical crowdsourcing experiments and highlighting their importance in human-AI collaboration.

Contribution

It proposes novel quantitative measures for interpretability quality and trust, validated by empirical data, to advance evaluation and development of trustworthy ML systems.

Findings

01

The interpretability metric effectively differentiates methods.

02

Explanations significantly increased annotation productivity.

03

The trust metric detects human bias towards ML predictions.

Abstract

Decisions by Machine Learning (ML) models have become ubiquitous. Trusting these decisions requires understanding how algorithms take them. Hence interpretability methods for ML are an active focus of research. A central problem in this context is that both the quality of interpretability methods as well as trust in ML predictions are difficult to measure. Yet evaluations, comparisons and improvements of trust and interpretability require quantifiable measures. Here we propose a quantitative measure for the quality of interpretability methods. Based on that we derive a quantitative measure of trust in ML decisions. Building on previous work we propose to measure intuitive understanding of algorithmic decisions using the information transfer rate at which humans replicate ML model predictions. We provide empirical evidence from crowdsourcing experiments that the proposed metric robustly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

anahid1988/DeepRUL
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Ethics and Social Impacts of AI

MethodsInterpretability