Zero-Shot Spam Email Classification Using Pre-trained Large Language Models
Sergio Rojas-Galeano

TL;DR
This study evaluates the effectiveness of pre-trained large language models like Flan-T5 and GPT-4 for zero-shot spam email classification, demonstrating promising results without additional training but highlighting cost challenges.
Contribution
It introduces a zero-shot classification approach using LLMs on spam detection, comparing open-source and proprietary models with different input strategies.
Findings
Flan-T5 achieves 90% F1-score with truncated content
GPT-4 reaches 95% F1-score with summaries
High operational costs may limit real-world deployment
Abstract
This paper investigates the application of pre-trained large language models (LLMs) for spam email classification using zero-shot prompting. We evaluate the performance of both open-source (Flan-T5) and proprietary LLMs (ChatGPT, GPT-4) on the well-known SpamAssassin dataset. Two classification approaches are explored: (1) truncated raw content from email subject and body, and (2) classification based on summaries generated by ChatGPT. Our empirical analysis, leveraging the entire dataset for evaluation without further training, reveals promising results. Flan-T5 achieves a 90% F1-score on the truncated content approach, while GPT-4 reaches a 95% F1-score using summaries. While these initial findings on a single dataset suggest the potential for classification pipelines of LLM-based subtasks (e.g., summarisation and classification), further validation on diverse datasets is necessary.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Advanced Malware Detection Techniques
MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Position-Wise Feed-Forward Layer · Multi-Head Attention · Dropout · Dense Connections
