Implicit N-grams Induced by Recurrence
Xiaobing Sun, Wei Lu

TL;DR
This paper uncovers explainable n-gram-like components within RNN hidden states, demonstrating their role in modeling linguistic phenomena and contributing to RNN performance in NLP tasks, thus enhancing interpretability.
Contribution
It reveals that RNNs contain interpretable n-gram components, offering insights into their internal mechanisms and inspiring new architecture designs for sequential data.
Findings
Extracted n-gram features from RNNs relate to negation and intensification.
N-gram components alone can effectively perform sentiment analysis and language modeling.
These components significantly contribute to RNNs' overall performance.
Abstract
Although self-attention based models such as Transformers have achieved remarkable successes on natural language processing (NLP) tasks, recent studies reveal that they have limitations on modeling sequential transformations (Hahn, 2020), which may prompt re-examinations of recurrent neural networks (RNNs) that demonstrated impressive results on handling sequential data. Despite many prior attempts to interpret RNNs, their internal mechanisms have not been fully understood, and the question on how exactly they capture sequential features remains largely unclear. In this work, we present a study that shows there actually exist some explainable components that reside within the hidden states, which are reminiscent of the classical n-grams features. We evaluated such extracted explainable features from trained RNNs on downstream sentiment analysis tasks and found they could be used to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining
