Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs
W. James Murdoch, Peter J. Liu, Bin Yu

TL;DR
This paper introduces contextual decomposition (CD), a novel interpretation method for LSTMs that reveals how word combinations influence predictions, enabling extraction of complex interactions like negations in sentiment analysis.
Contribution
The paper presents CD as a model-agnostic technique to interpret LSTM predictions by decomposing outputs into contributions of word interactions, including negations.
Findings
CD reliably identifies contrasting sentiment words and phrases.
CD successfully extracts negations from LSTM predictions.
Method improves understanding of LSTM decision processes.
Abstract
The driving force behind the recent success of LSTMs has been their ability to learn complex and non-linear relationships. Consequently, our inability to describe these relationships has led to LSTMs being characterized as black boxes. To this end, we introduce contextual decomposition (CD), an interpretation algorithm for analysing individual predictions made by standard LSTMs, without any changes to the underlying model. By decomposing the output of a LSTM, CD captures the contributions of combinations of words or variables to the final prediction of an LSTM. On the task of sentiment analysis with the Yelp and SST data sets, we show that CD is able to reliably identify words and phrases of contrasting sentiment, and how they are combined to yield the LSTM's final prediction. Using the phrase-level labels in SST, we also demonstrate that CD is able to successfully extract positive and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
