Understanding Convolutional Neural Networks for Text Classification

Alon Jacovi; Oren Sar Shalom; Yoav Goldberg

arXiv:1809.08037·cs.CL·April 29, 2020·24 cites

Understanding Convolutional Neural Networks for Text Classification

Alon Jacovi, Oren Sar Shalom, Yoav Goldberg

PDF

Open Access 2 Repos

TL;DR

This paper investigates how CNNs process text for classification, revealing that filters detect multiple semantic n-grams and that max-pooling isolates important features, enhancing interpretability.

Contribution

It provides a detailed analysis of CNN filter behavior in text classification, bridging interpretability gaps between vision and NLP models.

Findings

01

Filters capture multiple semantic classes of n-grams.

02

Global max-pooling separates important n-grams from less relevant ones.

03

Practical interpretability methods for CNNs in NLP are demonstrated.

Abstract

We present an analysis into the inner workings of Convolutional Neural Networks (CNNs) for processing text. CNNs used for computer vision can be interpreted by projecting filters into image space, but for discrete sequence inputs CNNs remain a mystery. We aim to understand the method by which the networks process and classify text. We examine common hypotheses to this problem: that filters, accompanied by global max-pooling, serve as ngram detectors. We show that filters may capture several different semantic classes of ngrams by using different activation patterns, and that global max-pooling induces behavior which separates important ngrams from the rest. Finally, we show practical use cases derived from our findings in the form of model interpretability (explaining a trained model by deriving a concrete identity for each filter, bridging the gap between visualization tools in vision…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Anomaly Detection Techniques and Applications

MethodsInterpretability