The Pragmatic Mind of Machines: Tracing the Emergence of Pragmatic Competence in Large Language Models

Kefan Yu; Qingcheng Zeng; Weihao Xuan; Wanxin Li; Jingyi Wu; Rob Voigt

arXiv:2505.18497·cs.CL·January 13, 2026

The Pragmatic Mind of Machines: Tracing the Emergence of Pragmatic Competence in Large Language Models

Kefan Yu, Qingcheng Zeng, Weihao Xuan, Wanxin Li, Jingyi Wu, Rob Voigt

PDF

Open Access 1 Datasets 1 Video

TL;DR

This paper investigates how large language models develop pragmatic understanding during training, introducing a new dataset and evaluating models at different stages to reveal emergent pragmatic competence aligned with human communication.

Contribution

The study introduces ALTPRAG, a novel dataset for assessing pragmatic inference in LLMs, and systematically evaluates how pragmatic skills emerge and improve through training stages.

Findings

01

Pragmatic competence improves with model size and training data scale.

02

Supervised fine-tuning and reinforcement learning further enhance pragmatic understanding.

03

Base models already show sensitivity to pragmatic cues, which grows with training.

Abstract

Current large language models (LLMs) have demonstrated emerging capabilities in social intelligence tasks, including implicature resolution and theory-of-mind reasoning, both of which require substantial pragmatic understanding. However, how LLMs acquire this pragmatic competence throughout the training process remains poorly understood. In this work, we introduce ALTPRAG, a dataset grounded in the pragmatic concept of alternatives, to evaluate whether LLMs at different training stages can accurately infer nuanced speaker intentions. Each instance pairs two equally plausible yet pragmatically divergent continuations and requires the model to (i) infer the speaker's intended meaning and (ii) explain when and why a speaker would choose one utterance over its alternative, thus directly probing pragmatic competence through contrastive reasoning. We systematically evaluate 22 LLMs across 3…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Huangtubaye233/AltPrag
dataset· 10 dl
10 dl

Videos

The Pragmatic Mind of Machines: Tracing the Emergence of Pragmatic Competence in Large Language Models· underline

Taxonomy

TopicsNatural Language Processing Techniques

MethodsShrink and Fine-Tune · Balanced Selection