Mean Opinion Score as a New Metric for User-Evaluation of XAI Methods

Hyeon Yu; Jenny Benois-Pineau; Romain Bourqui; Romain Giot; Alexey; Zhukov

arXiv:2407.20427·cs.CV·July 31, 2024

Mean Opinion Score as a New Metric for User-Evaluation of XAI Methods

Hyeon Yu, Jenny Benois-Pineau, Romain Bourqui, Romain Giot, Alexey, Zhukov

PDF

Open Access

TL;DR

This paper explores using Mean Opinion Score (MOS), a user-centric metric, to evaluate XAI explanation methods, comparing it with automatic metrics and analyzing their correlation through user experiments.

Contribution

It introduces MOS as a novel user-centric evaluation metric for XAI methods and compares its effectiveness with existing automatic metrics.

Findings

01

MOS correlates highly with IAUC and DAUC for MLFEM

02

Limited overall correlation between user-centric and automatic metrics

03

Highlights need for better alignment between automatic and user evaluations

Abstract

This paper investigates the use of Mean Opinion Score (MOS), a common image quality metric, as a user-centric evaluation metric for XAI post-hoc explainers. To measure the MOS, a user experiment is proposed, which has been conducted with explanation maps of intentionally distorted images. Three methods from the family of feature attribution methods - Gradient-weighted Class Activation Mapping (Grad-CAM), Multi-Layered Feature Explanation Method (MLFEM), and Feature Explanation Method (FEM) - are compared with this metric. Additionally, the correlation of this new user-centric metric with automatic metrics is studied via Spearman's rank correlation coefficient. MOS of MLFEM shows the highest correlation with automatic metrics of Insertion Area Under Curve (IAUC) and Deletion Area Under Curve (DAUC). However, the overall correlations are limited, which highlights the lack of consensus…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Visual Attention and Saliency Detection · Image and Video Quality Assessment