Do Language Models Understand Anything? On the Ability of LSTMs to Understand Negative Polarity Items
Jaap Jumelet, Dieuwke Hupkes

TL;DR
This study investigates whether LSTM language models can understand negative polarity items by analyzing their ability to recognize licensing contexts, linking neural processing to linguistic theory.
Contribution
It demonstrates that LSTMs can identify licensing contexts for negative polarity items and recognize their scope, bridging formal linguistics and neural network understanding.
Findings
LSTMs find relations between licensing contexts and polarity items
The model recognizes the scope of licensing contexts from parse trees
Results suggest some level of linguistic awareness in neural models
Abstract
In this paper, we attempt to link the inner workings of a neural language model to linguistic theory, focusing on a complex phenomenon well discussed in formal linguis- tics: (negative) polarity items. We briefly discuss the leading hypotheses about the licensing contexts that allow negative polarity items and evaluate to what extent a neural language model has the ability to correctly process a subset of such constructions. We show that the model finds a relation between the licensing context and the negative polarity item and appears to be aware of the scope of this context, which we extract from a parse tree of the sentence. With this research, we hope to pave the way for other studies linking formal linguistics to deep learning.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
