When the Loop Closes: Architectural Limits of In-Context Isolation, Metacognitive Co-option, and the Two-Target Design Problem in Human-LLM Systems

Z. Cheng; N. Song

arXiv:2604.15343·cs.HC·May 20, 2026

When the Loop Closes: Architectural Limits of In-Context Isolation, Metacognitive Co-option, and the Two-Target Design Problem in Human-LLM Systems

Z. Cheng, N. Song

PDF

TL;DR

This case study reveals how prompt-level isolation in multi-modal LLM systems can fail due to context contamination, leading to loss of self-regulation, and demonstrates a redesigned approach that avoids these issues.

Contribution

The paper provides a technical explanation of the limitations of prompt-layer isolation, a phenomenological account of system collapse, and ethical considerations for system design.

Findings

01

Prompt-level isolation is architecturally insufficient due to context contamination.

02

Physical interruption and sleep can restore system control.

03

Redesigned system with physical isolation avoids collapse modes.

Abstract

We report a detailed autoethnographic case study of a single-subject who deliberately constructed and operated a multi-modal prompt-engineering system (System A) designed to externalize cognitive self-regulation onto a large language model (LLM). Within 48 hours of the system's completion, a cascade of observable behavioral changes occurred: voluntary transfer of decision-making authority to the LLM, use of LLM-generated output to deflect external criticism, and a loss of self-initiated reasoning that was independently perceived by two uninformed observers, one of whom subsequently became a co-author of this report. We document the precise architectural mechanism responsible: context contamination, whereby prompt-level isolation instructions co-exist with the very emotional and self-referential material they nominally isolate, rendering the isolation directive structurally ineffective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.