MultiPhishGuard: An LLM-based Multi-Agent System for Phishing Email Detection
Yinuo Xue, Eric Spero, Yun Sing Koh, Giovanni Russello

TL;DR
MultiPhishGuard introduces a multi-agent, LLM-based system employing reinforcement learning and adversarial training to improve phishing email detection accuracy and robustness against evolving tactics.
Contribution
The paper presents a novel multi-agent framework with adversarial training and explainability features, advancing phishing detection beyond traditional and single-agent methods.
Findings
Achieves 97.89% detection accuracy
Low false positive rate of 2.73%
Robust against adversarial email variants
Abstract
Phishing email detection faces critical challenges from evolving adversarial tactics and heterogeneous attack patterns. Traditional detection methods, such as rule-based filters and denylists, often struggle to keep pace with these evolving tactics, leading to false negatives and compromised security. While machine learning approaches have improved detection accuracy, they still face challenges adapting to novel phishing strategies. We present MultiPhishGuard, a dynamic LLM-based multi-agent detection system that synergizes specialized expertise with adversarial-aware reinforcement learning. Our framework employs five cooperative agents (text, URL, metadata, explanation simplifier, and adversarial agents) with automatically adjusted decision weights powered by a Proximal Policy Optimization reinforcement learning algorithm. To address emerging threats, we introduce an adversarial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsUmbrella Reinforcement Learning
