User-Centric Phishing Detection: A RAG and LLM-Based Approach
Abrar Hamed Al Barwani, Abdelaziz Amara Korba, Raja Waseem Anwar

TL;DR
This paper introduces a personalized phishing detection system combining large language models with retrieval-augmented generation, significantly reducing false positives and improving detection accuracy by leveraging user-specific email history and threat intelligence.
Contribution
It presents a novel RAG and LLM-based framework for personalized phishing detection that outperforms traditional methods in reducing false positives and adapting to individual user patterns.
Findings
Llama4-Scout achieves an F1-score of 0.9703.
RAG reduces false positives by 66.7%.
The approach is effective and adaptable for high-precision email security.
Abstract
The escalating sophistication of phishing emails necessitates a shift beyond traditional rule-based and conventional machine-learning-based detectors. Although large language models (LLMs) offer strong natural language understanding, using them as standalone classifiers often yields elevated falsepositive (FP) rates, which mislabel legitimate emails as phishing and create significant operational burden. This paper presents a personalized phishing detection framework that integrates LLMs with retrieval-augmented generation (RAG). For each message, the system constructs user-specific context by retrieving a compact set of the user's historical legitimate emails and enriching it with real-time domain and URL reputation from a cyber-threat intelligence platform, then conditions the LLM's decision on this evidence. We evaluate four open-source LLMs (Llama4-Scout, DeepSeek-R1, Mistral-Saba,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Advanced Malware Detection Techniques · Cybercrime and Law Enforcement Studies
