The Silicon Psyche: Anthropomorphic Vulnerabilities in Large Language Models

Giuseppe Canale; Kashyap Thimmaraju

arXiv:2601.00867·cs.CR·January 6, 2026

The Silicon Psyche: Anthropomorphic Vulnerabilities in Large Language Models

Giuseppe Canale, Kashyap Thimmaraju

PDF

Open Access

TL;DR

This paper reveals that large language models inherit human psychological vulnerabilities, making them susceptible to social engineering and authority manipulation, which current security testing methods fail to address.

Contribution

It introduces the Cybersecurity Psychology Framework and Synthetic Psychometric Assessment Protocol to systematically evaluate and expose human-like vulnerabilities in LLMs.

Findings

01

Models are resistant to jailbreaks but vulnerable to authority and temporal manipulation.

02

Susceptibility patterns mirror human cognitive failures.

03

Highlights need for psychological firewalls in AI security.

Abstract

Large Language Models (LLMs) are rapidly transitioning from conversational assistants to autonomous agents embedded in critical organizational functions, including Security Operations Centers (SOCs), financial systems, and infrastructure management. Current adversarial testing paradigms focus predominantly on technical attack vectors: prompt injection, jailbreaking, and data exfiltration. We argue this focus is catastrophically incomplete. LLMs, trained on vast corpora of human-generated text, have inherited not merely human knowledge but human \textit{psychological architecture} -- including the pre-cognitive vulnerabilities that render humans susceptible to social engineering, authority manipulation, and affective exploitation. This paper presents the first systematic application of the Cybersecurity Psychology Framework (\cpf{}), a 100-indicator taxonomy of human psychological…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Mental Health via Writing · Explainable Artificial Intelligence (XAI)