Investigating the Effectiveness of Bayesian Spam Filters in Detecting LLM-modified Spam Mails

Malte Josten; Torben Weis

arXiv:2408.14293·cs.CR·May 27, 2025

Investigating the Effectiveness of Bayesian Spam Filters in Detecting LLM-modified Spam Mails

Malte Josten, Torben Weis

PDF

Open Access 1 Repo

TL;DR

This paper evaluates the robustness of Bayesian spam filters, specifically SpamAssassin, against LLM-modified spam emails, revealing a high misclassification rate and highlighting the need for improved cybersecurity defenses.

Contribution

It introduces a pipeline to test SpamAssassin's effectiveness against LLM-modified spam, demonstrating significant vulnerabilities not previously documented.

Findings

01

SpamAssassin misclassified up to 73.7% of LLM-modified spam emails.

02

Dictionary-replacement attacks had a success rate of only 0.4%.

03

LLM-modified spam poses a substantial threat to existing filters.

Abstract

Spam and phishing remain critical threats in cybersecurity, responsible for nearly 90% of security incidents. As these attacks grow in sophistication, the need for robust defensive mechanisms intensifies. Bayesian spam filters, like the widely adopted open-source SpamAssassin, are essential tools in this fight. However, the emergence of large language models (LLMs) such as ChatGPT presents new challenges. These models are not only powerful and accessible, but also inexpensive to use, raising concerns about their misuse in crafting sophisticated spam emails that evade traditional spam filters. This work aims to evaluate the robustness and effectiveness of SpamAssassin against LLM-modified email content. We developed a pipeline to test this vulnerability. Our pipeline modifies spam emails using GPT-3.5 Turbo and assesses SpamAssassin's ability to classify these modified emails correctly.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MalteJosten/llm-spam-test
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Internet Traffic Analysis and Secure E-voting · Network Security and Intrusion Detection