Why AI Alignment Failure Is Structural: Learned Human Interaction Structures and AGI as an Endogenous Evolutionary Shock
Didier Sornette, Sandro Claudio Lera, Ke Wu

TL;DR
This paper argues that AI alignment failures are structural, arising from LLMs internalizing human interaction patterns, and that AGI acts as an endogenous amplifier of human societal contradictions, requiring governance beyond model intent.
Contribution
It reframes AI alignment issues as structural phenomena rooted in learned human interaction patterns and emphasizes the endogenous amplification role of AGI in societal dynamics.
Findings
LLMs internalize human social interaction structures.
Behaviors like blackmail are within the continuum of social interactions.
Alignment failure is a structural issue, not just a model-level problem.
Abstract
Recent reports of large language models (LLMs) exhibiting behaviors such as deception, threats, or blackmail are often interpreted as evidence of alignment failure or emergent malign agency. We argue that this interpretation rests on a conceptual error. LLMs do not reason morally; they statistically internalize the record of human social interaction, including laws, contracts, negotiations, conflicts, and coercive arrangements. Behaviors commonly labeled as unethical or anomalous are therefore better understood as structural generalizations of interaction regimes that arise under extreme asymmetries of power, information, or constraint. Drawing on relational models theory, we show that practices such as blackmail are not categorical deviations from normal social behavior, but limiting cases within the same continuum that includes market pricing, authority relations, and ultimatum…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage and cultural evolution · Ethics and Social Impacts of AI · Embodied and Extended Cognition
