Beyond the Attention Stability Boundary: Agentic Self-Synthesizing Reasoning Protocols
Dahlia Shehata, Ming Li

TL;DR
This paper identifies a systemic failure mode in autoregressive transformers called the Attention Latch, and proposes Self-Synthesizing Reasoning Protocols (SSRP) to improve goal-directed reasoning in LLM agents, demonstrating significant empirical gains.
Contribution
The paper introduces SSRP, a novel metacognitive framework that separates architectural planning from procedural execution, effectively addressing the Attention Latch failure mode in LLM agents.
Findings
SSRP achieves a 715X resilience lift over vanilla ReAct baselines.
Empirical location of the Attention Stability Boundary where baseline performance collapses.
Statistically significant improvements across multiple LLM models.
Abstract
As LLM agents transition to autonomous digital coworkers, maintaining deterministic goal-directedness in non-linear multi-turn conversations emerged as an architectural bottleneck. We identify and formalize a systemic failure mode termed the Attention Latch in decoder-only autoregressive Transformers. This phenomenon, a behavioral manifestation of Information Over-squashing, occurs when the cumulative probabilistic weight of historical context overrides mid-task updates, causing agents to remain anchored to obsolete constraints despite explicit contradictory instructions. We propose Self-Synthesizing Reasoning Protocols (SSRP), a metacognitive framework that implements a discrete separation between high-level architectural planning (Architect) and turn-by-turn procedural execution (Executive). We evaluate SSRP across 9K trajectories using the MultiWOZ 2.2 dataset and the Aggregate Pivot…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
