EvalxNLP: A Framework for Benchmarking Post-Hoc Explainability Methods   on NLP Models

Mahdi Dhaini; Kafaite Zahra Hussain; Efstratios Zaradoukas; Gjergji; Kasneci

arXiv:2505.01238·cs.CL·May 5, 2025

EvalxNLP: A Framework for Benchmarking Post-Hoc Explainability Methods on NLP Models

Mahdi Dhaini, Kafaite Zahra Hussain, Efstratios Zaradoukas, Gjergji, Kasneci

PDF

Open Access 1 Repo

TL;DR

EvalxNLP is a comprehensive Python framework that benchmarks and evaluates various explainability methods for transformer-based NLP models, aiding stakeholders in selecting suitable explanations.

Contribution

It introduces a unified platform integrating eight explainability techniques and human evaluation, facilitating systematic comparison and advancement of XAI methods in NLP.

Findings

01

High user satisfaction with EvalxNLP's explanations

02

Effective benchmarking of multiple explainability methods

03

Enhanced understanding of explanation properties in NLP

Abstract

As Natural Language Processing (NLP) models continue to evolve and become integral to high-stakes applications, ensuring their interpretability remains a critical challenge. Given the growing variety of explainability methods and diverse stakeholder requirements, frameworks that help stakeholders select appropriate explanations tailored to their specific use cases are increasingly important. To address this need, we introduce EvalxNLP, a Python framework for benchmarking state-of-the-art feature attribution methods for transformer-based NLP models. EvalxNLP integrates eight widely recognized explainability techniques from the Explainable AI (XAI) literature, enabling users to generate and evaluate explanations based on key properties such as faithfulness, plausibility, and complexity. Our framework also provides interactive, LLM-based textual explanations, facilitating user…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dmah10/EvalxNLP
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling