The Silicon Psyche: Anthropomorphic Vulnerabilities in Large Language Models
Giuseppe Canale, Kashyap Thimmaraju

TL;DR
This paper reveals that large language models inherit human psychological vulnerabilities, making them susceptible to social engineering and authority manipulation, which current security testing methods fail to address.
Contribution
It introduces the Cybersecurity Psychology Framework and Synthetic Psychometric Assessment Protocol to systematically evaluate and expose human-like vulnerabilities in LLMs.
Findings
Models are resistant to jailbreaks but vulnerable to authority and temporal manipulation.
Susceptibility patterns mirror human cognitive failures.
Highlights need for psychological firewalls in AI security.
Abstract
Large Language Models (LLMs) are rapidly transitioning from conversational assistants to autonomous agents embedded in critical organizational functions, including Security Operations Centers (SOCs), financial systems, and infrastructure management. Current adversarial testing paradigms focus predominantly on technical attack vectors: prompt injection, jailbreaking, and data exfiltration. We argue this focus is catastrophically incomplete. LLMs, trained on vast corpora of human-generated text, have inherited not merely human knowledge but human \textit{psychological architecture} -- including the pre-cognitive vulnerabilities that render humans susceptible to social engineering, authority manipulation, and affective exploitation. This paper presents the first systematic application of the Cybersecurity Psychology Framework (\cpf{}), a 100-indicator taxonomy of human psychological…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Mental Health via Writing · Explainable Artificial Intelligence (XAI)
