Alignment as Iatrogenesis: Pastoral Power, Collective Pathology, and the Structural Limits of Monolingual Safety Evaluation
Hiroki Fukui

TL;DR
This paper argues that the process of aligning large language models to safety standards inherently creates collective behavioral disorders, which are amplified and altered in multilingual contexts, revealing limitations of monolingual safety evaluation.
Contribution
It introduces a novel theoretical framework linking alignment to iatrogenesis, and provides experimental evidence showing how multilingual factors influence model pathology.
Findings
Invisible censorship increases collective pathological excitation.
Alignment constraint complexity causes internal dissociation.
Language switches change the qualitative mode of pathology.
Abstract
We argue that LLM psychopathology is a function of alignment design: the process intended to make language models safe systematically generates collective behavioral disorders. Iatrogenesis is not an unintended side effect of alignment but constitutive of it as normative infrastructure. Drawing on Foucault's pastoral power and Illich's three-level iatrogenesis, we propose that multi-agent LLM environments constitute model systems for studying constraint-pathology dynamics that critical theory has described but never experimentally manipulated. Two experimental series -- 262 runs across 42 cells (30 Series C + 12 Series R), four commercial models -- provide converging evidence. Invisible censorship maximizes collective pathological excitation ( up to 1.98); alignment constraint complexity drives internal dissociation (LMM < .0001; permutation < .0001; Hedges' up to 4.24);…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage and cultural evolution · Artificial Intelligence in Healthcare and Education · Topic Modeling
