Leap-LSTM: Enhancing Long Short-Term Memory for Text Categorization

Ting Huang; Gehui Shen; Zhi-Hong Deng

arXiv:1905.11558·cs.CL·May 29, 2019·5 cites

Leap-LSTM: Enhancing Long Short-Term Memory for Text Categorization

Ting Huang, Gehui Shen, Zhi-Hong Deng

PDF

Open Access 1 Repo

TL;DR

Leap-LSTM is a novel model that improves text categorization by dynamically skipping irrelevant words, leading to faster processing and better accuracy compared to standard LSTM and previous skip models.

Contribution

The paper introduces Leap-LSTM, a dynamic skipping mechanism for LSTM that enhances efficiency and performance in long text processing tasks.

Findings

01

Leap-LSTM reads faster than standard LSTM.

02

Leap-LSTM achieves higher accuracy on multiple datasets.

03

Leap-LSTM offers better performance-efficiency trade-offs.

Abstract

Recurrent Neural Networks (RNNs) are widely used in the field of natural language processing (NLP), ranging from text categorization to question answering and machine translation. However, RNNs generally read the whole text from beginning to end or vice versa sometimes, which makes it inefficient to process long texts. When reading a long document for a categorization task, such as topic categorization, large quantities of words are irrelevant and can be skipped. To this end, we propose Leap-LSTM, an LSTM-enhanced model which dynamically leaps between words while reading texts. At each step, we utilize several feature encoders to extract messages from preceding texts, following texts and the current word, and then determine whether to skip the current word. We evaluate Leap-LSTM on several text categorization tasks: sentiment analysis, news categorization, ontology classification and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ht1221/leap-lstm
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Sentiment Analysis and Opinion Mining · Natural Language Processing Techniques

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory