Breaking the Illusion of Identity in LLM Tooling
Marek Miller

TL;DR
This paper introduces seven output-side rules to reduce anthropomorphic language in LLM outputs, significantly decreasing perceived agency and improving trust calibration without modifying the model.
Contribution
It proposes a systematic, deployable set of constraints targeting linguistic cues to mitigate identity illusions in LLM outputs, validated empirically.
Findings
Anthropomorphic markers dropped from 1233 to 33 (>97% reduction)
Outputs were 49% shorter by word count
Shift toward machine register confirmed by AnthroScore (-1.94 vs. -0.96, p < 0.001)
Abstract
Large language models (LLMs) in research and development toolchains produce output that triggers attribution of agency and understanding -- a cognitive illusion that degrades verification behavior and trust calibration. No existing mitigation provides a systematic, deployable constraint set for output register. This paper proposes seven output-side rules, each targeting a documented linguistic mechanism, and validates them empirically. In 780 two-turn conversations (constrained vs. default register, 30 tasks, 13 replicates, 1560 API calls), anthropomorphic markers dropped from 1233 to 33 (>97% reduction, p < 0.001), outputs were 49% shorter by word count, and adapted AnthroScore confirmed the shift toward machine register (-1.94 vs. -0.96, p < 0.001). The rules are implemented as a configuration-file system prompt requiring no model modification; validation uses a single model (Claude…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
