Do Language Models Understand Anything? On the Ability of LSTMs to   Understand Negative Polarity Items

Jaap Jumelet; Dieuwke Hupkes

arXiv:1808.10627·cs.CL·September 3, 2018

Do Language Models Understand Anything? On the Ability of LSTMs to Understand Negative Polarity Items

Jaap Jumelet, Dieuwke Hupkes

PDF

TL;DR

This study investigates whether LSTM language models can understand negative polarity items by analyzing their ability to recognize licensing contexts, linking neural processing to linguistic theory.

Contribution

It demonstrates that LSTMs can identify licensing contexts for negative polarity items and recognize their scope, bridging formal linguistics and neural network understanding.

Findings

01

LSTMs find relations between licensing contexts and polarity items

02

The model recognizes the scope of licensing contexts from parse trees

03

Results suggest some level of linguistic awareness in neural models

Abstract

In this paper, we attempt to link the inner workings of a neural language model to linguistic theory, focusing on a complex phenomenon well discussed in formal linguis- tics: (negative) polarity items. We briefly discuss the leading hypotheses about the licensing contexts that allow negative polarity items and evaluate to what extent a neural language model has the ability to correctly process a subset of such constructions. We show that the model finds a relation between the licensing context and the negative polarity item and appears to be aware of the scope of this context, which we extract from a parse tree of the sentence. With this research, we hope to pave the way for other studies linking formal linguistics to deep learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.