Investigating the Effectiveness of Bayesian Spam Filters in Detecting LLM-modified Spam Mails
Malte Josten, Torben Weis

TL;DR
This paper evaluates the robustness of Bayesian spam filters, specifically SpamAssassin, against LLM-modified spam emails, revealing a high misclassification rate and highlighting the need for improved cybersecurity defenses.
Contribution
It introduces a pipeline to test SpamAssassin's effectiveness against LLM-modified spam, demonstrating significant vulnerabilities not previously documented.
Findings
SpamAssassin misclassified up to 73.7% of LLM-modified spam emails.
Dictionary-replacement attacks had a success rate of only 0.4%.
LLM-modified spam poses a substantial threat to existing filters.
Abstract
Spam and phishing remain critical threats in cybersecurity, responsible for nearly 90% of security incidents. As these attacks grow in sophistication, the need for robust defensive mechanisms intensifies. Bayesian spam filters, like the widely adopted open-source SpamAssassin, are essential tools in this fight. However, the emergence of large language models (LLMs) such as ChatGPT presents new challenges. These models are not only powerful and accessible, but also inexpensive to use, raising concerns about their misuse in crafting sophisticated spam emails that evade traditional spam filters. This work aims to evaluate the robustness and effectiveness of SpamAssassin against LLM-modified email content. We developed a pipeline to test this vulnerability. Our pipeline modifies spam emails using GPT-3.5 Turbo and assesses SpamAssassin's ability to classify these modified emails correctly.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Internet Traffic Analysis and Secure E-voting · Network Security and Intrusion Detection
