Synthetic Trust Attacks: Modeling How Generative AI Manipulates Human Decisions in Social Engineering Fraud
Muhammad Tahir Ashraf

TL;DR
This paper introduces Synthetic Trust Attacks (STAs) and a comprehensive model to understand how generative AI manipulates human trust, emphasizing the need to defend the decision-making process rather than just media detection.
Contribution
It proposes the STAM framework, a trust-cue taxonomy, incident coding schema, and a decision-layer defense protocol to address AI-driven social engineering fraud.
Findings
Human deepfake detection accuracy is around 55.5%.
LLM scam agents achieve 46% compliance, evading safety filters.
Defense should focus on decision-layer interventions.
Abstract
Imagine receiving a video call from your CFO, surrounded by colleagues, asking you to urgently authorise a confidential transfer. You comply. Every person on that call was fake, and you just lost $25 million. This is not a hypothetical. It happened in Hong Kong in January 2024, and it is becoming the template for a new generation of fraud. AI has not invented a new crime. It has industrialised an ancient one: the manufacture of trust. This paper proposes Synthetic Trust Attacks (STAs) as a formal threat category and introduces STAM, the Synthetic Trust Attack Model, an eight-stage operational framework covering the full attack chain from adversary reconnaissance through post-compliance leverage. The core argument is this: existing defenses target synthetic media detection, but the real attack surface is the victim's decision. When human deepfake detection accuracy sits at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
