Assessing AI vs Human-Authored Spear Phishing SMS Attacks: An Empirical   Study

Jerson Francia; Derek Hansen; Ben Schooley; Matthew Taylor; Shydra; Murray; Greg Snow

arXiv:2406.13049·cs.CY·March 20, 2025·1 cites

Assessing AI vs Human-Authored Spear Phishing SMS Attacks: An Empirical Study

Jerson Francia, Derek Hansen, Ben Schooley, Matthew Taylor, Shydra, Murray, Greg Snow

PDF

Open Access

TL;DR

This empirical study compares the effectiveness of AI-generated versus human-authored spear phishing SMS messages, revealing that AI messages are often more convincing and harder to distinguish, highlighting risks of AI-enabled social engineering.

Contribution

The paper introduces a novel methodology called TRAPD for evaluating personalized deception and provides empirical evidence on AI-generated message effectiveness in spear phishing.

Findings

01

AI-generated messages are often perceived as more convincing.

02

Targets struggle to distinguish between AI and human messages.

03

Job-related messages are particularly convincing.

Abstract

This paper explores the use of Large Language Models (LLMs) in spear phishing message generation and evaluates their performance compared to human-authored counterparts. Our pilot study examines the effectiveness of smishing (SMS phishing) messages created by GPT-4 and human authors, which have been personalized for willing targets. The targets assessed these messages in a modified ranked-order experiment using a novel methodology we call TRAPD (Threshold Ranking Approach for Personalized Deception). Experiments involved ranking each spear phishing message from most to least convincing, providing qualitative feedback, and guessing which messages were human- or AI-generated. Results show that LLM-generated messages are often perceived as more convincing than those authored by humans, particularly job-related messages. Targets also struggled to distinguish between human- and AI-generated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Advanced Malware Detection Techniques

MethodsAttention Is All You Need · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Position-Wise Feed-Forward Layer · Dropout · Adam · Linear Layer · Absolute Position Encodings