RNNs as psycholinguistic subjects: Syntactic state and grammatical dependency
Richard Futrell, Ethan Wilcox, Takashi Morita, Roger Levy

TL;DR
This paper investigates how well RNNs, specifically LSTMs, capture syntactic states and grammatical dependencies in language, revealing they partially mimic human behavior but lack full grammatical understanding.
Contribution
It introduces controlled psycholinguistic experiments to analyze RNNs' syntactic and dependency representations, including a new Japanese LSTM model.
Findings
RNNs represent incremental syntactic state but with differences from humans.
Models fail to learn correct grammatical dependency configurations for reflexives and polarity items.
Japanese LSTM shows similar syntactic state representation as English models.
Abstract
Recurrent neural networks (RNNs) are the state of the art in sequence modeling for natural language. However, it remains poorly understood what grammatical characteristics of natural language they implicitly learn and represent as a consequence of optimizing the language modeling objective. Here we deploy the methods of controlled psycholinguistic experimentation to shed light on to what extent RNN behavior reflects incremental syntactic state and grammatical dependency representations known to characterize human linguistic behavior. We broadly test two publicly available long short-term memory (LSTM) English sequence models, and learn and test a new Japanese LSTM. We demonstrate that these models represent and maintain incremental syntactic state, but that they do not always generalize in the same way as humans. Furthermore, none of our models learn the appropriate grammatical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Neurobiology of Language and Bilingualism
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
