Evaluating statistical language models as pragmatic reasoners

Benjamin Lipkin; Lionel Wong; Gabriel Grand; Joshua B; Tenenbaum

arXiv:2305.01020·cs.CL·May 3, 2023·5 cites

Evaluating statistical language models as pragmatic reasoners

Benjamin Lipkin, Lionel Wong, Gabriel Grand, Joshua B, Tenenbaum

PDF

Open Access 1 Repo

TL;DR

This paper evaluates large language models' ability to interpret pragmatic language, especially complex utterances involving context, negation, and composition, revealing their strengths and limitations in semantic inference.

Contribution

It demonstrates that LLMs can approximate human-like pragmatic interpretation but face challenges with negation and complex compositional semantics.

Findings

01

LLMs can infer context-grounded interpretations of pragmatic utterances

02

LLMs struggle with negation in pragmatic language

03

Results inform the use of LLMs in semantic and pragmatic parsing

Abstract

The relationship between communicated language and intended meaning is often probabilistic and sensitive to context. Numerous strategies attempt to estimate such a mapping, often leveraging recursive Bayesian models of communication. In parallel, large language models (LLMs) have been increasingly applied to semantic parsing applications, tasked with inferring logical representations from natural language. While existing LLM explorations have been largely restricted to literal language use, in this work, we evaluate the capacity of LLMs to infer the meanings of pragmatic utterances. Specifically, we explore the case of threshold estimation on the gradable adjective ``strong'', contextually conditioned on a strength prior, then extended to composition with qualification, negation, polarity inversion, and class comparison. We find that LLMs can derive context-grounded, human-like…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

benlipkin/probsem
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems