A Comparative Study of Faithfulness Metrics for Model Interpretability   Methods

Chun Sik Chan; Huanqi Kong; Guanqing Liang

arXiv:2204.05514·cs.CL·April 13, 2022

A Comparative Study of Faithfulness Metrics for Model Interpretability Methods

Chun Sik Chan, Huanqi Kong, Guanqing Liang

PDF

Open Access

TL;DR

This paper compares various faithfulness metrics for model interpretability, revealing conflicting preferences among them and evaluating their diagnosticity and computational efficiency.

Contribution

It provides a comprehensive comparison of faithfulness metrics, introducing assessment dimensions and highlighting the superior diagnosticity and efficiency of sufficiency and comprehensiveness.

Findings

01

Sufficiency and comprehensiveness metrics have higher diagnosticity.

02

These metrics also exhibit lower time complexity.

03

Conflicting preferences are observed among different faithfulness metrics.

Abstract

Interpretation methods to reveal the internal reasoning processes behind machine learning models have attracted increasing attention in recent years. To quantify the extent to which the identified interpretations truly reflect the intrinsic decision-making mechanisms, various faithfulness evaluation metrics have been proposed. However, we find that different faithfulness metrics show conflicting preferences when comparing different interpretations. Motivated by this observation, we aim to conduct a comprehensive and comparative study of the widely adopted faithfulness metrics. In particular, we introduce two assessment dimensions, namely diagnosticity and time complexity. Diagnosticity refers to the degree to which the faithfulness metric favours relatively faithful interpretations over randomly generated ones, and time complexity is measured by the average number of model forward…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification