Three Regimes of Context-Parametric Conflict: A Predictive Framework and Empirical Validation
Pruthvinath Jeripity Venkata

TL;DR
This paper introduces a three-regime framework to explain how large language models handle conflicting information, validated through extensive empirical testing across multiple models and experimental conditions.
Contribution
It formalizes a novel three-regime framework distinguishing different processing situations and empirically validates it across diverse models and experimental setups.
Findings
Models exhibit a certainty gradient consistent with the Regime 2 prediction.
Task framing can flip context-following behavior from near-100% to very low levels.
Parametric strength and uniqueness are orthogonal dimensions influencing model behavior.
Abstract
The literature on how large language models handle conflict between their training knowledge and a contradicting document presents a persistent empirical contradiction: some studies find models stubbornly retain their trained answers, ignoring provided documents nearly half the time, while others find models readily defer to the document, following context approximately 96% of the time. We argue these contradictions dissolve once one recognises that prior experiments have studied three qualitatively distinct processing situations without distinguishing them. We propose a three-regime framework: Regime 1 (single-source updating, dominant predictor: evidence coherence), Regime 2 (competitive integration, dominant predictor: parametric certainty), and Regime 3 (task-appropriate selection, dominant predictor: task knowledge requirement). We formalise a distinction between parametric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
