TL;DR
This paper introduces the Decay RNN, a biologically inspired architecture that effectively captures syntax-sensitive dependencies, bridging the gap between biological plausibility and linguistic competence, and performing competitively with LSTMs.
Contribution
The paper proposes the Decay RNN, a new architecture incorporating neuronal decay and excitatory/inhibitory dynamics, improving modeling of linguistic dependencies.
Findings
Decoy RNN performs well on subject-verb agreement tasks.
It achieves competitive results on grammaticality and language modeling.
The model offers insights into biologically plausible neural architectures for language.
Abstract
Long short-term memory (LSTM) networks and their variants are capable of encapsulating long-range dependencies, which is evident from their performance on a variety of linguistic tasks. On the other hand, simple recurrent networks (SRNs), which appear more biologically grounded in terms of synaptic connections, have generally been less successful at capturing long-range dependencies as well as the loci of grammatical errors in an unsupervised setting. In this paper, we seek to develop models that bridge the gap between biological plausibility and linguistic competence. We propose a new architecture, the Decay RNN, which incorporates the decaying nature of neuronal activations and models the excitatory and inhibitory connections in a population of neurons. Besides its biological inspiration, our model also shows competitive performance relative to LSTMs on subject-verb agreement,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
