The Language Model Understood the Prompt was Ambiguous: Probing Syntactic Uncertainty Through Generation
Laura Aina, Tal Linzen

TL;DR
This paper investigates how neural language models handle temporary syntactic ambiguities by generating multiple sentence completions and estimating their uncertainty, revealing their ability to track multiple analyses and respond to cues.
Contribution
It introduces a novel probing method using stochastic decoding to assess syntactic uncertainty in language models, allowing exploration of completions beyond predefined hypotheses.
Findings
Language models can track multiple syntactic analyses simultaneously.
Uncertainty levels vary depending on construction and context.
Models often select correct interpretations when disambiguating cues are present.
Abstract
Temporary syntactic ambiguities arise when the beginning of a sentence is compatible with multiple syntactic analyses. We inspect to which extent neural language models (LMs) exhibit uncertainty over such analyses when processing temporarily ambiguous inputs, and how that uncertainty is modulated by disambiguating cues. We probe the LM's expectations by generating from it: we use stochastic decoding to derive a set of sentence completions, and estimate the probability that the LM assigns to each interpretation based on the distribution of parses across completions. Unlike scoring-based methods for targeted syntactic evaluation, this technique makes it possible to explore completions that are not hypothesized in advance by the researcher. We apply this method to study the behavior of two LMs (GPT2 and an LSTM) on three types of temporary ambiguity, using materials from human sentence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
