DRAGNN: A Transition-based Framework for Dynamically Connected Neural   Networks

Lingpeng Kong; Chris Alberti; Daniel Andor; Ivan Bogatyy; David Weiss

arXiv:1703.04474·cs.CL·March 14, 2017·31 cites

DRAGNN: A Transition-based Framework for Dynamically Connected Neural Networks

Lingpeng Kong, Chris Alberti, Daniel Andor, Ivan Bogatyy, David Weiss

PDF

Open Access 1 Repo

TL;DR

DRAGNN introduces a flexible, modular framework for constructing dynamic recurrent neural networks with discrete state dynamics, improving accuracy and efficiency in tasks like parsing and summarization.

Contribution

The paper presents DRAGNN, a novel framework utilizing Transition Based Recurrent Units (TBRUs) for dynamic, modular neural network construction, enabling better multi-task learning and structural flexibility.

Findings

01

More accurate syntactic dependency parsing

02

Enhanced multi-task learning for summarization

03

Significant efficiency improvements over seq2seq models

Abstract

In this work, we present a compact, modular framework for constructing novel recurrent neural architectures. Our basic module is a new generic unit, the Transition Based Recurrent Unit (TBRU). In addition to hidden layer activations, TBRUs have discrete state dynamics that allow network connections to be built dynamically as a function of intermediate activations. By connecting multiple TBRUs, we can extend and combine commonly used architectures such as sequence-to-sequence, attention mechanisms, and re-cursive tree-structured models. A TBRU can also serve as both an encoder for downstream tasks and as a decoder for its own task simultaneously, resulting in more accurate multi-task learning. We call our approach Dynamic Recurrent Acyclic Graphical Neural Networks, or DRAGNN. We show that DRAGNN is significantly more accurate and efficient than seq2seq with attention for syntactic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tensorflow/models/tree/master/research/syntaxnet
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence