Manipulating and Measuring Model Interpretability

Forough Poursabzi-Sangdeh; Daniel G. Goldstein; Jake M. Hofman,; Jennifer Wortman Vaughan; Hanna Wallach

arXiv:1802.07810·cs.AI·August 17, 2021

Manipulating and Measuring Model Interpretability

Forough Poursabzi-Sangdeh, Daniel G. Goldstein, Jake M. Hofman,, Jennifer Wortman Vaughan, Hanna Wallach

PDF

1 Repo

TL;DR

This study experimentally investigates how different factors like feature count and transparency affect model interpretability, revealing that clearer models improve simulation but may hinder mistake detection due to information overload.

Contribution

It provides empirical evidence that transparency and feature count influence interpretability, challenging assumptions about what makes models more understandable.

Findings

01

Clear models improve prediction simulation

02

Transparency can reduce mistake detection ability

03

More features do not necessarily enhance interpretability

Abstract

With machine learning models being increasingly used to aid decision making even in high-stakes domains, there has been a growing interest in developing interpretable models. Although many supposedly interpretable models have been proposed, there have been relatively few experimental studies investigating whether these models achieve their intended effects, such as making people more closely follow a model's predictions when it is beneficial for them to do so or enabling them to detect when a model has made a mistake. We present a sequence of pre-registered experiments (N=3,800) in which we showed participants functionally identical models that varied only in two factors commonly thought to make machine learning models more or less interpretable: the number of features and the transparency of the model (i.e., whether the model internals are clear or black box). Predictably, participants…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

foroughp/manipulating-and-measuring-model-interpretability
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.