Phishing Detection in the Gen-AI Era: Quantized LLMs vs Classical Models
Jikesh Thapa, Gurrehmat Chahal, Serban Voinea Gabreanu, Yazan Otoum

TL;DR
This paper compares traditional ML, DL, and quantized LLMs for phishing detection, highlighting their strengths, limitations, and potential for real-time, cost-effective cybersecurity applications.
Contribution
It provides a comprehensive benchmarking of quantized LLMs against classical models, exploring their accuracy, robustness, and interpretability in phishing detection.
Findings
Quantized LLMs achieve over 80% accuracy with low VRAM usage.
Rephrasing emails significantly reduces detection performance.
Lightweight LLMs offer interpretable explanations for decisions.
Abstract
Phishing attacks are becoming increasingly sophisticated, underscoring the need for detection systems that strike a balance between high accuracy and computational efficiency. This paper presents a comparative evaluation of traditional Machine Learning (ML), Deep Learning (DL), and quantized small-parameter Large Language Models (LLMs) for phishing detection. Through experiments on a curated dataset, we show that while LLMs currently underperform compared to ML and DL methods in terms of raw accuracy, they exhibit strong potential for identifying subtle, context-based phishing cues. We also investigate the impact of zero-shot and few-shot prompting strategies, revealing that LLM-rephrased emails can significantly degrade the performance of both ML and LLM-based detectors. Our benchmarking highlights that models like DeepSeek R1 Distill Qwen 14B (Q8_0) achieve competitive accuracy, above…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Advanced Malware Detection Techniques · Adversarial Robustness in Machine Learning
