Representational and Behavioral Stability of Truth in Large Language Models

Samantha Dies; Courtney Maynard; Germans Savcisens; Tina Eliassi-Rad

arXiv:2511.19166·cs.CL·January 21, 2026

Representational and Behavioral Stability of Truth in Large Language Models

Samantha Dies, Courtney Maynard, Germans Savcisens, Tina Eliassi-Rad

PDF

Open Access 2 Datasets

TL;DR

This paper introduces P-StaT, a framework for evaluating the stability of truth judgments in large language models under semantic perturbations, revealing that familiarity influences belief stability.

Contribution

The paper presents P-StaT, a novel evaluation method for assessing how semantic framing affects LLMs' belief stability, highlighting the role of epistemic familiarity.

Findings

01

Synthetic content causes more belief retractions than fictional content.

02

Fictional content is more stable and representationally distinct.

03

Epistemic familiarity correlates with belief stability.

Abstract

Large language models (LLMs) are increasingly used as information sources, yet small changes in semantic framing can destabilize their truth judgments. We propose P-StaT (Perturbation Stability of Truth), an evaluation framework for testing belief stability under controlled semantic perturbations in representational and behavioral settings via probing and zero-shot prompting. Across sixteen open-source LLMs and three domains, we compare perturbations involving epistemically familiar Neither statements drawn from well-known fictional contexts (Fictional) to those involving unfamiliar Neither statements not seen in training data (Synthetic). We find a consistent stability hierarchy: Synthetic content aligns closely with factual representations and induces the largest retractions of previously held beliefs, producing up to $32.7%$ retractions in representational evaluations and up to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Artificial Intelligence in Healthcare and Education