Explaining the Deep Natural Language Processing by Mining Textual   Interpretable Features

Francesco Ventura; Salvatore Greco; Daniele Apiletti; Tania; Cerquitelli

arXiv:2106.06697·cs.CL·June 15, 2021

Explaining the Deep Natural Language Processing by Mining Textual Interpretable Features

Francesco Ventura, Salvatore Greco, Daniele Apiletti, Tania, Cerquitelli

PDF

Open Access

TL;DR

This paper introduces T-EBAnO, a new explainability framework for deep NLP models that extracts interpretable features and quantifies their influence to make model decisions more transparent and human-readable.

Contribution

The paper presents T-EBAnO, a novel framework that provides local and global explanations for deep NLP models using new influence indexes and interpretable feature extraction.

Findings

01

Effective explanations for BERT in sentiment analysis

02

Accurate interpretation of LSTM in toxic comment classification

03

Demonstrated improved transparency of deep NLP models

Abstract

Despite the high accuracy offered by state-of-the-art deep natural-language models (e.g. LSTM, BERT), their application in real-life settings is still widely limited, as they behave like a black-box to the end-user. Hence, explainability is rapidly becoming a fundamental requirement of future-generation data-driven systems based on deep-learning approaches. Several attempts to fulfill the existing gap between accuracy and interpretability have been done. However, robust and specialized xAI (Explainable Artificial Intelligence) solutions tailored to deep natural-language models are still missing. We propose a new framework, named T-EBAnO, which provides innovative prediction-local and class-based model-global explanation strategies tailored to black-box deep natural-language models. Given a deep NLP model and the textual input data, T-EBAnO provides an objective, human-readable,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Adversarial Robustness in Machine Learning

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Refunds@Expedia|||How do I get a full refund from Expedia? · Tanh Activation · Linear Warmup With Linear Decay · Residual Connection · WordPiece · Attention Dropout