Implicit N-grams Induced by Recurrence

Xiaobing Sun; Wei Lu

arXiv:2205.02724·cs.CL·May 6, 2022

Implicit N-grams Induced by Recurrence

Xiaobing Sun, Wei Lu

PDF

Open Access 1 Repo

TL;DR

This paper uncovers explainable n-gram-like components within RNN hidden states, demonstrating their role in modeling linguistic phenomena and contributing to RNN performance in NLP tasks, thus enhancing interpretability.

Contribution

It reveals that RNNs contain interpretable n-gram components, offering insights into their internal mechanisms and inspiring new architecture designs for sequential data.

Findings

01

Extracted n-gram features from RNNs relate to negation and intensification.

02

N-gram components alone can effectively perform sentiment analysis and language modeling.

03

These components significantly contribute to RNNs' overall performance.

Abstract

Although self-attention based models such as Transformers have achieved remarkable successes on natural language processing (NLP) tasks, recent studies reveal that they have limitations on modeling sequential transformations (Hahn, 2020), which may prompt re-examinations of recurrent neural networks (RNNs) that demonstrated impressive results on handling sequential data. Despite many prior attempts to interpret RNNs, their internal mechanisms have not been fully understood, and the question on how exactly they capture sequential features remains largely unclear. In this work, we present a study that shows there actually exist some explainable components that reside within the hidden states, which are reminiscent of the classical n-grams features. We evaluated such extracted explainable features from trained RNNs on downstream sentiment analysis tasks and found they could be used to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

richardsun-voyager/inibr
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining