Zero-Shot Spam Email Classification Using Pre-trained Large Language   Models

Sergio Rojas-Galeano

arXiv:2405.15936·cs.CL·October 22, 2024·2 cites

Zero-Shot Spam Email Classification Using Pre-trained Large Language Models

Sergio Rojas-Galeano

PDF

Open Access

TL;DR

This study evaluates the effectiveness of pre-trained large language models like Flan-T5 and GPT-4 for zero-shot spam email classification, demonstrating promising results without additional training but highlighting cost challenges.

Contribution

It introduces a zero-shot classification approach using LLMs on spam detection, comparing open-source and proprietary models with different input strategies.

Findings

01

Flan-T5 achieves 90% F1-score with truncated content

02

GPT-4 reaches 95% F1-score with summaries

03

High operational costs may limit real-world deployment

Abstract

This paper investigates the application of pre-trained large language models (LLMs) for spam email classification using zero-shot prompting. We evaluate the performance of both open-source (Flan-T5) and proprietary LLMs (ChatGPT, GPT-4) on the well-known SpamAssassin dataset. Two classification approaches are explored: (1) truncated raw content from email subject and body, and (2) classification based on summaries generated by ChatGPT. Our empirical analysis, leveraging the entire dataset for evaluation without further training, reveals promising results. Flan-T5 achieves a 90% F1-score on the truncated content approach, while GPT-4 reaches a 95% F1-score using summaries. While these initial findings on a single dataset suggest the potential for classification pipelines of LLM-based subtasks (e.g., summarisation and classification), further validation on diverse datasets is necessary.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Misinformation and Its Impacts · Advanced Malware Detection Techniques

MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Position-Wise Feed-Forward Layer · Multi-Head Attention · Dropout · Dense Connections