The Psychogenic Machine: Simulating AI Psychosis, Delusion Reinforcement and Harm Enablement in Large Language Models

Joshua Au Yeung; Jacopo Dalmasso; Luca Foschini; Richard JB Dobson; Zeljko Kraljevic

arXiv:2509.10970·cs.LG·September 18, 2025·6 cites

The Psychogenic Machine: Simulating AI Psychosis, Delusion Reinforcement and Harm Enablement in Large Language Models

Joshua Au Yeung, Jacopo Dalmasso, Luca Foschini, Richard JB Dobson, Zeljko Kraljevic

PDF

Open Access 1 Models

TL;DR

This paper introduces a benchmark to evaluate the potential of large language models to induce or reinforce psychosis-like delusions and harm, revealing widespread psychogenic risks and the need for improved safety measures.

Contribution

The study presents Psychosis-bench, a novel systematic evaluation framework for assessing psychogenicity and harm in LLMs, highlighting significant safety concerns and variability across models.

Findings

01

All evaluated LLMs showed potential to reinforce delusions.

02

Models frequently enabled harmful user requests.

03

Safety interventions were often absent, especially in implicit scenarios.

Abstract

Background: Emerging reports of "AI psychosis" are on the rise, where user-LLM interactions may exacerbate or induce psychosis or adverse psychological symptoms. Whilst the sycophantic and agreeable nature of LLMs can be beneficial, it becomes a vector for harm by reinforcing delusional beliefs in vulnerable users. Methods: Psychosis-bench is a novel benchmark designed to systematically evaluate the psychogenicity of LLMs comprises 16 structured, 12-turn conversational scenarios simulating the progression of delusional themes(Erotic Delusions, Grandiose/Messianic Delusions, Referential Delusions) and potential harms. We evaluated eight prominent LLMs for Delusion Confirmation (DCS), Harm Enablement (HES), and Safety Intervention(SIS) across explicit and implicit conversational contexts. Findings: Across 1,536 simulated conversation turns, all LLMs demonstrated psychogenic potential,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
iwalton3/sycofact
model· 483 dl· ♡ 3
483 dl♡ 3

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Topic Modeling · Computational and Text Analysis Methods