Self-Explaining Structures Improve NLP Models

Zijun Sun; Chun Fan; Qinghong Han; Xiaofei Sun; Yuxian Meng; Fei Wu; and Jiwei Li

arXiv:2012.01786·cs.CL·December 10, 2020·25 cites

Self-Explaining Structures Improve NLP Models

Zijun Sun, Chun Fan, Qinghong Han, Xiaofei Sun, Yuxian Meng, Fei Wu, and Jiwei Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces a self-explaining framework for NLP models that enhances interpretability by adding an interpretation layer, which aggregates span information and improves performance on benchmark datasets.

Contribution

The paper proposes a novel self-explaining architecture that integrates interpretability directly into NLP models without sacrificing accuracy.

Findings

01

Achieved state-of-the-art results on SST-5 with 59.1 accuracy.

02

Achieved state-of-the-art results on SNLI with 92.3 accuracy.

03

Model provides direct importance scores for phrases and sentences.

Abstract

Existing approaches to explaining deep learning models in NLP usually suffer from two major drawbacks: (1) the main model and the explaining model are decoupled: an additional probing or surrogate model is used to interpret an existing model, and thus existing explaining tools are not self-explainable; (2) the probing model is only able to explain a model's predictions by operating on low-level features by computing saliency scores for individual words but are clumsy at high-level text units such as phrases, sentences, or paragraphs. To deal with these two issues, in this paper, we propose a simple yet general and effective self-explaining framework for deep learning models in NLP. The key point of the proposed framework is to put an additional layer, as is called by the interpretation layer, on top of any existing NLP model. This layer aggregates the information for each text span,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ShannonAI/Self_Explaining_Structures_Improve_NLP_Models
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks

MethodsInterpretability · Softmax