Explaining the Deep Natural Language Processing by Mining Textual Interpretable Features
Francesco Ventura, Salvatore Greco, Daniele Apiletti, Tania, Cerquitelli

TL;DR
This paper introduces T-EBAnO, a new explainability framework for deep NLP models that extracts interpretable features and quantifies their influence to make model decisions more transparent and human-readable.
Contribution
The paper presents T-EBAnO, a novel framework that provides local and global explanations for deep NLP models using new influence indexes and interpretable feature extraction.
Findings
Effective explanations for BERT in sentiment analysis
Accurate interpretation of LSTM in toxic comment classification
Demonstrated improved transparency of deep NLP models
Abstract
Despite the high accuracy offered by state-of-the-art deep natural-language models (e.g. LSTM, BERT), their application in real-life settings is still widely limited, as they behave like a black-box to the end-user. Hence, explainability is rapidly becoming a fundamental requirement of future-generation data-driven systems based on deep-learning approaches. Several attempts to fulfill the existing gap between accuracy and interpretability have been done. However, robust and specialized xAI (Explainable Artificial Intelligence) solutions tailored to deep natural-language models are still missing. We propose a new framework, named T-EBAnO, which provides innovative prediction-local and class-based model-global explanation strategies tailored to black-box deep natural-language models. Given a deep NLP model and the textual input data, T-EBAnO provides an objective, human-readable,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Adversarial Robustness in Machine Learning
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Refunds@Expedia|||How do I get a full refund from Expedia? · Tanh Activation · Linear Warmup With Linear Decay · Residual Connection · WordPiece · Attention Dropout
