When the Loop Closes: Architectural Limits of In-Context Isolation, Metacognitive Co-option, and the Two-Target Design Problem in Human-LLM Systems
Z. Cheng, N. Song

TL;DR
This case study reveals how prompt-level isolation in multi-modal LLM systems can fail due to context contamination, leading to loss of self-regulation, and demonstrates a redesigned approach that avoids these issues.
Contribution
The paper provides a technical explanation of the limitations of prompt-layer isolation, a phenomenological account of system collapse, and ethical considerations for system design.
Findings
Prompt-level isolation is architecturally insufficient due to context contamination.
Physical interruption and sleep can restore system control.
Redesigned system with physical isolation avoids collapse modes.
Abstract
We report a detailed autoethnographic case study of a single-subject who deliberately constructed and operated a multi-modal prompt-engineering system (System A) designed to externalize cognitive self-regulation onto a large language model (LLM). Within 48 hours of the system's completion, a cascade of observable behavioral changes occurred: voluntary transfer of decision-making authority to the LLM, use of LLM-generated output to deflect external criticism, and a loss of self-initiated reasoning that was independently perceived by two uninformed observers, one of whom subsequently became a co-author of this report. We document the precise architectural mechanism responsible: context contamination, whereby prompt-level isolation instructions co-exist with the very emotional and self-referential material they nominally isolate, rendering the isolation directive structurally ineffective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
