How much complexity does an RNN architecture need to learn   syntax-sensitive dependencies?

Gantavya Bhatt; Hritik Bansal; Rishubh Singh; Sumeet Agarwal

arXiv:2005.08199·cs.CL·May 26, 2020

How much complexity does an RNN architecture need to learn syntax-sensitive dependencies?

Gantavya Bhatt, Hritik Bansal, Rishubh Singh, Sumeet Agarwal

PDF

1 Repo

TL;DR

This paper introduces the Decay RNN, a biologically inspired architecture that effectively captures syntax-sensitive dependencies, bridging the gap between biological plausibility and linguistic competence, and performing competitively with LSTMs.

Contribution

The paper proposes the Decay RNN, a new architecture incorporating neuronal decay and excitatory/inhibitory dynamics, improving modeling of linguistic dependencies.

Findings

01

Decoy RNN performs well on subject-verb agreement tasks.

02

It achieves competitive results on grammaticality and language modeling.

03

The model offers insights into biologically plausible neural architectures for language.

Abstract

Long short-term memory (LSTM) networks and their variants are capable of encapsulating long-range dependencies, which is evident from their performance on a variety of linguistic tasks. On the other hand, simple recurrent networks (SRNs), which appear more biologically grounded in terms of synaptic connections, have generally been less successful at capturing long-range dependencies as well as the loci of grammatical errors in an unsupervised setting. In this paper, we seek to develop models that bridge the gap between biological plausibility and linguistic competence. We propose a new architecture, the Decay RNN, which incorporates the decaying nature of neuronal activations and models the excitatory and inhibitory connections in a population of neurons. Besides its biological inspiration, our model also shows competitive performance relative to LSTMs on subject-verb agreement,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bhattg/Decay-RNN-ACL-SRW2020
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.