A Case Study on Designing Evaluations of ML Explanations with Simulated   User Studies

Ada Martin; Valerie Chen; S\'ergio Jesus; Pedro Saleiro

arXiv:2302.07444·cs.LG·March 22, 2023

A Case Study on Designing Evaluations of ML Explanations with Simulated User Studies

Ada Martin, Valerie Chen, S\'ergio Jesus, Pedro Saleiro

PDF

Open Access

TL;DR

This paper investigates the use of simulated user evaluations (SimEvals) as a cost-effective method to assess ML explanations in e-commerce fraud detection, finding that SimEvals can replicate key results of real user studies and aid in designing evaluations.

Contribution

First application of SimEvals on a real-world use case demonstrating their potential to replicate user study findings and inform evaluation design.

Findings

01

SimEvals suggested all explainers performed equally, matching user study results.

02

SimEvals indicated explanations did not outperform baseline models.

03

Results support using SimEvals as a preliminary evaluation step.

Abstract

When conducting user studies to ascertain the usefulness of model explanations in aiding human decision-making, it is important to use real-world use cases, data, and users. However, this process can be resource-intensive, allowing only a limited number of explanation methods to be evaluated. Simulated user evaluations (SimEvals), which use machine learning models as a proxy for human users, have been proposed as an intermediate step to select promising explanation methods. In this work, we conduct the first SimEvals on a real-world use case to evaluate whether explanations can better support ML-assisted decision-making in e-commerce fraud detection. We study whether SimEvals can corroborate findings from a user study conducted in this fraud detection context. In particular, we find that SimEvals suggest that all considered explainers are equally performant, and none beat a baseline…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Scientific Computing and Data Management · Machine Learning and Data Classification

MethodsNone