Evaluating statistical language models as pragmatic reasoners
Benjamin Lipkin, Lionel Wong, Gabriel Grand, Joshua B, Tenenbaum

TL;DR
This paper evaluates large language models' ability to interpret pragmatic language, especially complex utterances involving context, negation, and composition, revealing their strengths and limitations in semantic inference.
Contribution
It demonstrates that LLMs can approximate human-like pragmatic interpretation but face challenges with negation and complex compositional semantics.
Findings
LLMs can infer context-grounded interpretations of pragmatic utterances
LLMs struggle with negation in pragmatic language
Results inform the use of LLMs in semantic and pragmatic parsing
Abstract
The relationship between communicated language and intended meaning is often probabilistic and sensitive to context. Numerous strategies attempt to estimate such a mapping, often leveraging recursive Bayesian models of communication. In parallel, large language models (LLMs) have been increasingly applied to semantic parsing applications, tasked with inferring logical representations from natural language. While existing LLM explorations have been largely restricted to literal language use, in this work, we evaluate the capacity of LLMs to infer the meanings of pragmatic utterances. Specifically, we explore the case of threshold estimation on the gradable adjective ``strong'', contextually conditioned on a strength prior, then extended to composition with qualification, negation, polarity inversion, and class comparison. We find that LLMs can derive context-grounded, human-like…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
