Prompted Contextual Vectors for Spear-Phishing Detection
Daniel Nahmias, Gal Engelberg, Dan Klein, Asaf Shabtai

TL;DR
This paper introduces a novel LLM-based vectorization method for detecting spear-phishing emails, achieving high accuracy on a proprietary dataset by quantifying persuasion principles through prompted reasoning.
Contribution
The paper presents a new document vectorization approach using LLM prompts for persuasion detection, along with a publicly available spear-phishing dataset and demonstrated effectiveness.
Findings
91% F1 score in spear-phishing detection
Effective detection with only traditional phishing and benign emails in training
Method applicable to various document classification tasks in adversarial settings
Abstract
Spear-phishing attacks present a significant security challenge, with large language models (LLMs) escalating the threat by generating convincing emails and facilitating target reconnaissance. To address this, we propose a detection approach based on a novel document vectorization method that utilizes an ensemble of LLMs to create representation vectors. By prompting LLMs to reason and respond to human-crafted questions, we quantify the presence of common persuasion principles in the email's content, producing prompted contextual document vectors for a downstream supervised machine learning model. We evaluate our method using a unique dataset generated by a proprietary system that automates target reconnaissance and spear-phishing email creation. Our method achieves a 91\% F1 score in identifying LLM-generated spear-phishing emails, with the training set comprising only traditional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Advanced Malware Detection Techniques
MethodsSparse Evolutionary Training
