Prompted Contextual Vectors for Spear-Phishing Detection

Daniel Nahmias; Gal Engelberg; Dan Klein; Asaf Shabtai

arXiv:2402.08309·cs.LG·December 25, 2024·6 cites

Prompted Contextual Vectors for Spear-Phishing Detection

Daniel Nahmias, Gal Engelberg, Dan Klein, Asaf Shabtai

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel LLM-based vectorization method for detecting spear-phishing emails, achieving high accuracy on a proprietary dataset by quantifying persuasion principles through prompted reasoning.

Contribution

The paper presents a new document vectorization approach using LLM prompts for persuasion detection, along with a publicly available spear-phishing dataset and demonstrated effectiveness.

Findings

01

91% F1 score in spear-phishing detection

02

Effective detection with only traditional phishing and benign emails in training

03

Method applicable to various document classification tasks in adversarial settings

Abstract

Spear-phishing attacks present a significant security challenge, with large language models (LLMs) escalating the threat by generating convincing emails and facilitating target reconnaissance. To address this, we propose a detection approach based on a novel document vectorization method that utilizes an ensemble of LLMs to create representation vectors. By prompting LLMs to reason and respond to human-crafted questions, we quantify the presence of common persuasion principles in the email's content, producing prompted contextual document vectors for a downstream supervised machine learning model. We evaluate our method using a unique dataset generated by a proprietary system that automates target reconnaissance and spear-phishing email creation. Our method achieves a 91\% F1 score in identifying LLM-generated spear-phishing emails, with the training set comprising only traditional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nahmiasd/prompted-contextual-vectors-for-spear-phishing-detection
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Advanced Malware Detection Techniques

MethodsSparse Evolutionary Training