Performance is not enough: the story told by a Rashomon quartet

Przemyslaw Biecek; Hubert Baniecki; Mateusz Krzyzinski; Dianne Cook

arXiv:2302.13356·stat.ML·September 11, 2024·1 cites

Performance is not enough: the story told by a Rashomon quartet

Przemyslaw Biecek, Hubert Baniecki, Mateusz Krzyzinski, Dianne Cook

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that multiple models with similar predictive accuracy can have fundamentally different explanations of data relationships, highlighting the importance of visualization beyond performance metrics.

Contribution

It introduces the Rashomon Quartet, a set of four models with similar performance but different explanations, emphasizing the need for visualization in model comparison.

Findings

01

Models with similar accuracy can have different data explanations

02

Visual analysis reveals diverse relationships despite comparable performance

03

Encourages use of visualization to understand model differences

Abstract

The usual goal of supervised learning is to find the best model, the one that optimizes a particular performance measure. However, what if the explanation provided by this model is completely different from another model and different again from another model despite all having similarly good fit statistics? Is it possible that the equally effective models put the spotlight on different relationships in the data? Inspired by Anscombe's quartet, this paper introduces a Rashomon Quartet, i.e. a set of four models built on a synthetic dataset which have practically identical predictive performance. However, the visual exploration reveals distinct explanations of the relations in the data. This illustrative example aims to encourage the use of methods for model visualization to compare predictive models beyond their performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mi2datalab/rashomon-quartet
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Visualization and Analytics · Time Series Analysis and Forecasting · Data Analysis with R