Depth-Adaptive Graph Recurrent Network for Text Classification

Yijin Liu; Fandong Meng; Yufeng Chen; Jinan Xu; Jie Zhou

arXiv:2003.00166·cs.CL·March 3, 2020·1 cites

Depth-Adaptive Graph Recurrent Network for Text Classification

Yijin Liu, Fandong Meng, Yufeng Chen, Jinan Xu, Jie Zhou

PDF

Open Access 1 Repo

TL;DR

This paper introduces a depth-adaptive graph recurrent network for text classification that dynamically allocates computational steps per word and incorporates sequential information, improving performance and efficiency.

Contribution

It proposes a novel depth-adaptive mechanism for S-LSTM that adjusts computation per word and integrates an RNN layer for sequential info, enhancing text classification.

Findings

01

Significant accuracy improvements over S-LSTM and Transformer.

02

Effective balance between model speed and accuracy.

03

Robust performance across 24 diverse datasets.

Abstract

The Sentence-State LSTM (S-LSTM) is a powerful and high efficient graph recurrent network, which views words as nodes and performs layer-wise recurrent steps between them simultaneously. Despite its successes on text representations, the S-LSTM still suffers from two drawbacks. Firstly, given a sentence, certain words are usually more ambiguous than others, and thus more computation steps need to be taken for these difficult words and vice versa. However, the S-LSTM takes fixed computation steps for all words, irrespective of their hardness. The secondary one comes from the lack of sequential information (e.g., word order) that is inherently important for natural language. In this paper, we try to address these issues and propose a depth-adaptive mechanism for the S-LSTM, which allows the model to learn how many computational steps to conduct for different words as required. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Adaxry/Depth-Adaptive-GRN
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Text and Document Classification Technologies · Sentiment Analysis and Opinion Mining

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory