SHAP values for Explaining CNN-based Text Classification Models

Wei Zhao; Tarun Joshi; Vijayan N. Nair; and Agus Sudjianto

arXiv:2008.11825·cs.CL·July 12, 2021·28 cites

SHAP values for Explaining CNN-based Text Classification Models

Wei Zhao, Tarun Joshi, Vijayan N. Nair, and Agus Sudjianto

PDF

Open Access

TL;DR

This paper introduces a methodology for computing SHAP values to explain CNN-based text classification models, addressing interpretability challenges in NLP with high-dimensional and structured data, demonstrated on sentiment analysis.

Contribution

It develops a novel approach to compute local and global SHAP scores for CNN models in NLP, enhancing interpretability in complex text classification tasks.

Findings

01

Effective local explanations for CNN text classifiers.

02

Global feature importance scores for NLP models.

03

Application to sentiment analysis demonstrates practical utility.

Abstract

Deep neural networks are increasingly used in natural language processing (NLP) models. However, the need to interpret and explain the results from complex algorithms are limiting their widespread adoption in regulated industries such as banking. There has been recent work on interpretability of machine learning algorithms with structured data. But there are only limited techniques for NLP applications where the problem is more challenging due to the size of the vocabulary, high-dimensional nature, and the need to consider textual coherence and language structure. This paper develops a methodology to compute SHAP values for local explainability of CNN-based text classification models. The approach is also extended to compute global scores to assess the importance of features. The results are illustrated on sentiment analysis of Amazon Electronic Review data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Machine Learning in Healthcare

MethodsInterpretability · Shapley Additive Explanations