PragWorld: A Benchmark Evaluating LLMs' Local World Model under Minimal Linguistic Alterations and Conversational Dynamics
Sachin Vashistha, Aryan Bibhuti, Atharva Naik, Martin Tutek, Somak Aditya

TL;DR
This paper evaluates how well language models can maintain and update internal world models during conversations, especially under minimal linguistic changes, revealing their struggles and proposing interpretability and fine-tuning solutions.
Contribution
It introduces benchmarks for assessing LLMs' ability to encode and update conversational world models and proposes interpretability and regularization methods to improve their robustness.
Findings
LLMs struggle to accurately track entities under linguistic alterations.
Transformer layers can be identified as useful or harmful for maintaining conversation details.
Layer-regularization fine-tuning improves LLM robustness in conversational tasks.
Abstract
Real-world conversations are rich with pragmatic elements, such as entity mentions, references, and implicatures. Understanding such nuances is a requirement for successful natural communication, and often requires building a local world model which encodes such elements and captures the dynamics of their evolving states. However, it is not well-understood whether language models (LMs) construct or maintain a robust implicit representation of conversations. In this work, we evaluate the ability of LMs to encode and update their internal world model in dyadic conversations and test their malleability under linguistic alterations. To facilitate this, we apply seven minimal linguistic alterations to conversations sourced from popular datasets and construct two benchmarks comprising yes-no questions. We evaluate a wide range of open and closed source LMs and observe that they struggle to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Machine Learning in Healthcare
