When Roles Fail: Epistemic Constraints on Advocate Role Fidelity in LLM-Based Political Statement Analysis
Juergen Dietrich

TL;DR
This paper empirically investigates the fidelity of roles assigned to LLMs in political discourse analysis, revealing failure modes and the impact of model choice and language on role maintenance.
Contribution
It introduces an epistemic stance classifier and systematically measures role fidelity, uncovering mechanisms of role failure and effects of model and language choices.
Findings
Role drift occurs due to epistemic constraints and model choice.
Mistral Large outperforms Claude Sonnet in role fidelity by 28 percentage points.
Fact-check provider choice affects role fidelity differently across languages.
Abstract
Democratic discourse analysis systems increasingly rely on multi-agent LLM pipelines in which distinct evaluator models are assigned adversarial roles to generate structured, multi-perspective assessments of political statements. A core assumption is that models will reliably maintain their assigned roles. This paper provides the first systematic empirical test of that assumption using the TRUST pipeline. We develop an epistemic stance classifier that identifies advocate roles from reasoning text without relying on surface vocabulary, and measure role fidelity across 60 political statements (30 English, 30 German) using four metrics: Role Drift Index (RDI), Expected Drift Distance (EDD), Directional Drift Index (DDI), and Entropy-based Role Stability (ERS). We identify two failure modes - the Epistemic Floor Effect (fact-check results create an absolute lower bound below which the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
