On the Evaluation of the Plausibility and Faithfulness of Sentiment   Analysis Explanations

Julia El Zini; Mohamad Mansour; Basel Mousi; and Mariette Awad

arXiv:2210.06916·cs.CL·October 14, 2022

On the Evaluation of the Plausibility and Faithfulness of Sentiment Analysis Explanations

Julia El Zini, Mohamad Mansour, Basel Mousi, and Mariette Awad

PDF

TL;DR

This paper proposes a unified evaluation framework for sentiment analysis explanations, assessing faithfulness and plausibility through new metrics, and compares different models and explanation methods, highlighting the superior explainability of transformer architectures.

Contribution

It introduces novel metrics inspired by information retrieval to evaluate sentiment analysis explanations, and provides empirical insights into the performance of various architectures and explanation methods.

Findings

01

Anchors explanations align better with human judgment.

02

Reasoning complexity reduces explanation quality.

03

Transformers outperform other architectures in explainability.

Abstract

Current Explainable AI (ExAI) methods, especially in the NLP field, are conducted on various datasets by employing different metrics to evaluate several aspects. The lack of a common evaluation framework is hindering the progress tracking of such methods and their wider adoption. In this work, inspired by offline information retrieval, we propose different metrics and techniques to evaluate the explainability of SA models from two angles. First, we evaluate the strength of the extracted "rationales" in faithfully explaining the predicted outcome. Second, we measure the agreement between ExAI methods and human judgment on a homegrown dataset1 to reflect on the rationales plausibility. Our conducted experiments comprise four dimensions: (1) the underlying architectures of SA models, (2) the approach followed by the ExAI method, (3) the reasoning difficulty, and (4) the homogeneity of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.