Phishing Email Detection Using Inputs From Artificial Intelligence

Mith\"un Paul; Genevieve Bartlett; Jelena Mirkovic; Marjorie Freedman

arXiv:2405.12494·cs.CR·May 22, 2024·3 cites

Phishing Email Detection Using Inputs From Artificial Intelligence

Mith\"un Paul, Genevieve Bartlett, Jelena Mirkovic, Marjorie Freedman

PDF

Open Access

TL;DR

This paper explores using artificial intelligence and natural language processing to detect phishing emails by analyzing signals similar to those taught in security training, aiming to improve automatic detection and training methods.

Contribution

It introduces a new dataset with annotated signals for phishing detection and develops baseline models, bridging human training signals with machine learning approaches.

Findings

01

Baseline models perform comparably to human annotators on signal detection.

02

Analysis provides insights for improving training curricula for both humans and machines.

03

The dataset enables further research in AI-driven phishing detection.

Abstract

Enterprise security is increasingly being threatened by social engineering attacks, such as phishing, which deceive employees into giving access to enterprise data. To protect both the users themselves and enterprise data, more and more organizations provide cyber security training that seeks to teach employees/customers to identify and report suspicious content. By its very nature, such training seeks to focus on signals that are likely to persist across a wide range of attacks. Further, it expects the user to apply the learnings from these training on e-mail messages that were not filtered by existing, automatic enterprise security (e.g., spam filters and commercial phishing detection software). However, relying on such training now shifts the detection of phishing from an automatic process to a human driven one which is fallible especially when a user errs due to distraction,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection