On the Granularity of Explanations in Model Agnostic NLP   Interpretability

Yves Rychener; Xavier Renard; Djam\'e Seddah; Pascal Frossard; Marcin; Detyniecki

arXiv:2012.13189·cs.CL·August 9, 2022·1 cites

On the Granularity of Explanations in Model Agnostic NLP Interpretability

Yves Rychener, Xavier Renard, Djam\'e Seddah, Pascal Frossard, Marcin, Detyniecki

PDF

Open Access 1 Repo

TL;DR

This paper proposes using sentence segments instead of individual words for interpreting BERT-based NLP models, improving fidelity and addressing limitations of current word-based explanation methods like LIME and SHAP.

Contribution

It introduces a segmentation-based approach for model-agnostic NLP interpretability, demonstrating significant improvements over word-based methods.

Findings

01

Sentence segmentation improves explanation fidelity.

02

Segment-based explanations are more computationally efficient.

03

The approach outperforms traditional word-based methods on benchmark tasks.

Abstract

Current methods for Black-Box NLP interpretability, like LIME or SHAP, are based on altering the text to interpret by removing words and modeling the Black-Box response. In this paper, we outline limitations of this approach when using complex BERT-based classifiers: The word-based sampling produces texts that are out-of-distribution for the classifier and further gives rise to a high-dimensional search space, which can't be sufficiently explored when time or computation power is limited. Both of these challenges can be addressed by using segments as elementary building blocks for NLP interpretability. As illustration, we show that the simple choice of sentences greatly improves on both of these challenges. As a consequence, the resulting explainer attains much better fidelity on a benchmark classification task.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

axa-rev-research/gutek
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Natural Language Processing Techniques

MethodsLinear Layer · Softmax · WordPiece · Linear Warmup With Linear Decay · Adam · Dense Connections · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · Layer Normalization · Attention Is All You Need