On the Evaluation of the Plausibility and Faithfulness of Sentiment Analysis Explanations
Julia El Zini, Mohamad Mansour, Basel Mousi, and Mariette Awad

TL;DR
This paper proposes a unified evaluation framework for sentiment analysis explanations, assessing faithfulness and plausibility through new metrics, and compares different models and explanation methods, highlighting the superior explainability of transformer architectures.
Contribution
It introduces novel metrics inspired by information retrieval to evaluate sentiment analysis explanations, and provides empirical insights into the performance of various architectures and explanation methods.
Findings
Anchors explanations align better with human judgment.
Reasoning complexity reduces explanation quality.
Transformers outperform other architectures in explainability.
Abstract
Current Explainable AI (ExAI) methods, especially in the NLP field, are conducted on various datasets by employing different metrics to evaluate several aspects. The lack of a common evaluation framework is hindering the progress tracking of such methods and their wider adoption. In this work, inspired by offline information retrieval, we propose different metrics and techniques to evaluate the explainability of SA models from two angles. First, we evaluate the strength of the extracted "rationales" in faithfully explaining the predicted outcome. Second, we measure the agreement between ExAI methods and human judgment on a homegrown dataset1 to reflect on the rationales plausibility. Our conducted experiments comprise four dimensions: (1) the underlying architectures of SA models, (2) the approach followed by the ExAI method, (3) the reasoning difficulty, and (4) the homogeneity of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
