When Self-Reference Fails to Close: Matrix-Level Dynamics in Large Language Models

Ji Ho Bae

arXiv:2604.12128·cs.CL·April 15, 2026

When Self-Reference Fails to Close: Matrix-Level Dynamics in Large Language Models

Ji Ho Bae

PDF

TL;DR

This paper explores how self-referential prompts affect the internal dynamics of large language models, revealing that paradoxical self-reference induces instability and disrupts attention patterns across multiple models and analysis passes.

Contribution

It provides a detailed empirical analysis of matrix-level dynamics in LLMs under self-reference, identifying specific instability patterns and proposing a conjecture linking NCTR prompts to classical matrix problems.

Findings

01

Self-reference alone is not destabilizing; paradoxical self-reference causes instability.

02

NCTR prompts lead to elevated attention effective rank and global dispersion.

03

A classifier can distinguish NCTR from stable self-reference with high accuracy.

Abstract

We investigate how self-referential inputs alter the internal matrix dynamics of large language models. Measuring 106 scalar metrics across up to 7 analysis passes on four models from three architecture families -- Qwen3-VL-8B, Llama-3.2-11B, Llama-3.3-70B, and Gemma-2-9B -- over 300 prompts in a 14-level hierarchy at three temperatures ( $T \in {0.0, 0.3, 0.7}$ ), we find that self-reference alone is not destabilizing: grounded self-referential statements and meta-cognitive prompts are markedly more stable than paradoxical self-reference on key collapse-related metrics, and on several such metrics can be as stable as factual controls. Instability concentrates in prompts inducing non-closing truth recursion (NCTR) -- truth-value computations with no finite-depth resolution. NCTR prompts produce anomalously elevated attention effective rank -- indicating attention reorganization with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.