Depth-Adaptive Graph Recurrent Network for Text Classification
Yijin Liu, Fandong Meng, Yufeng Chen, Jinan Xu, Jie Zhou

TL;DR
This paper introduces a depth-adaptive graph recurrent network for text classification that dynamically allocates computational steps per word and incorporates sequential information, improving performance and efficiency.
Contribution
It proposes a novel depth-adaptive mechanism for S-LSTM that adjusts computation per word and integrates an RNN layer for sequential info, enhancing text classification.
Findings
Significant accuracy improvements over S-LSTM and Transformer.
Effective balance between model speed and accuracy.
Robust performance across 24 diverse datasets.
Abstract
The Sentence-State LSTM (S-LSTM) is a powerful and high efficient graph recurrent network, which views words as nodes and performs layer-wise recurrent steps between them simultaneously. Despite its successes on text representations, the S-LSTM still suffers from two drawbacks. Firstly, given a sentence, certain words are usually more ambiguous than others, and thus more computation steps need to be taken for these difficult words and vice versa. However, the S-LSTM takes fixed computation steps for all words, irrespective of their hardness. The secondary one comes from the lack of sequential information (e.g., word order) that is inherently important for natural language. In this paper, we try to address these issues and propose a depth-adaptive mechanism for the S-LSTM, which allows the model to learn how many computational steps to conduct for different words as required. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Text and Document Classification Technologies · Sentiment Analysis and Opinion Mining
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
