Do Language Models Know Theo Has a Wife? Investigating the Proviso Problem
Tara Azin, Daniel Dumitrescu, Diana Inkpen, Raj Singh

TL;DR
This paper examines how language models handle the proviso problem in pragmatics, reformulating it as an NLI task, and finds they rely on shallow patterns rather than true pragmatic reasoning.
Contribution
Introduces the first computational framework and diagnostic dataset for evaluating the proviso problem in language models.
Findings
Models broadly align with human judgments
Models rely on shallow pattern matching
Highlights need for diagnostic approaches
Abstract
We investigate how language models handle the proviso problem, an unresolved issue in pragmatics where presuppositions in conditional sentences diverge between theoretical and human interpretations. We reformulate this phenomenon as a Natural Language Inference task and introduce a diagnostic dataset designed to probe presupposition projection in conditionals. We evaluate RoBERTa, DeBERTa, LLaMA, and Gemma using explainability analyses. The results show that models broadly align with human judgments but rely on shallow pattern matching rather than semantic or pragmatic reasoning. Our work provides the first computational evaluation framework for the proviso problem and highlights the need for diagnostic, multi-method approaches to assess pragmatic competence and context-dependent meaning in language models.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications
