DRAGNN: A Transition-based Framework for Dynamically Connected Neural Networks
Lingpeng Kong, Chris Alberti, Daniel Andor, Ivan Bogatyy, David Weiss

TL;DR
DRAGNN introduces a flexible, modular framework for constructing dynamic recurrent neural networks with discrete state dynamics, improving accuracy and efficiency in tasks like parsing and summarization.
Contribution
The paper presents DRAGNN, a novel framework utilizing Transition Based Recurrent Units (TBRUs) for dynamic, modular neural network construction, enabling better multi-task learning and structural flexibility.
Findings
More accurate syntactic dependency parsing
Enhanced multi-task learning for summarization
Significant efficiency improvements over seq2seq models
Abstract
In this work, we present a compact, modular framework for constructing novel recurrent neural architectures. Our basic module is a new generic unit, the Transition Based Recurrent Unit (TBRU). In addition to hidden layer activations, TBRUs have discrete state dynamics that allow network connections to be built dynamically as a function of intermediate activations. By connecting multiple TBRUs, we can extend and combine commonly used architectures such as sequence-to-sequence, attention mechanisms, and re-cursive tree-structured models. A TBRU can also serve as both an encoder for downstream tasks and as a decoder for its own task simultaneously, resulting in more accurate multi-task learning. We call our approach Dynamic Recurrent Acyclic Graphical Neural Networks, or DRAGNN. We show that DRAGNN is significantly more accurate and efficient than seq2seq with attention for syntactic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence
