Phishing Detection in the Gen-AI Era: Quantized LLMs vs Classical Models

Jikesh Thapa; Gurrehmat Chahal; Serban Voinea Gabreanu; Yazan Otoum

arXiv:2507.07406·cs.CR·December 25, 2025

Phishing Detection in the Gen-AI Era: Quantized LLMs vs Classical Models

Jikesh Thapa, Gurrehmat Chahal, Serban Voinea Gabreanu, Yazan Otoum

PDF

Open Access

TL;DR

This paper compares traditional ML, DL, and quantized LLMs for phishing detection, highlighting their strengths, limitations, and potential for real-time, cost-effective cybersecurity applications.

Contribution

It provides a comprehensive benchmarking of quantized LLMs against classical models, exploring their accuracy, robustness, and interpretability in phishing detection.

Findings

01

Quantized LLMs achieve over 80% accuracy with low VRAM usage.

02

Rephrasing emails significantly reduces detection performance.

03

Lightweight LLMs offer interpretable explanations for decisions.

Abstract

Phishing attacks are becoming increasingly sophisticated, underscoring the need for detection systems that strike a balance between high accuracy and computational efficiency. This paper presents a comparative evaluation of traditional Machine Learning (ML), Deep Learning (DL), and quantized small-parameter Large Language Models (LLMs) for phishing detection. Through experiments on a curated dataset, we show that while LLMs currently underperform compared to ML and DL methods in terms of raw accuracy, they exhibit strong potential for identifying subtle, context-based phishing cues. We also investigate the impact of zero-shot and few-shot prompting strategies, revealing that LLM-rephrased emails can significantly degrade the performance of both ML and LLM-based detectors. Our benchmarking highlights that models like DeepSeek R1 Distill Qwen 14B (Q8_0) achieve competitive accuracy, above…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · Advanced Malware Detection Techniques · Adversarial Robustness in Machine Learning