MultiPhishGuard: An LLM-based Multi-Agent System for Phishing Email Detection

Yinuo Xue; Eric Spero; Yun Sing Koh; Giovanni Russello

arXiv:2505.23803·cs.CR·June 2, 2025

MultiPhishGuard: An LLM-based Multi-Agent System for Phishing Email Detection

Yinuo Xue, Eric Spero, Yun Sing Koh, Giovanni Russello

PDF

TL;DR

MultiPhishGuard introduces a multi-agent, LLM-based system employing reinforcement learning and adversarial training to improve phishing email detection accuracy and robustness against evolving tactics.

Contribution

The paper presents a novel multi-agent framework with adversarial training and explainability features, advancing phishing detection beyond traditional and single-agent methods.

Findings

01

Achieves 97.89% detection accuracy

02

Low false positive rate of 2.73%

03

Robust against adversarial email variants

Abstract

Phishing email detection faces critical challenges from evolving adversarial tactics and heterogeneous attack patterns. Traditional detection methods, such as rule-based filters and denylists, often struggle to keep pace with these evolving tactics, leading to false negatives and compromised security. While machine learning approaches have improved detection accuracy, they still face challenges adapting to novel phishing strategies. We present MultiPhishGuard, a dynamic LLM-based multi-agent detection system that synergizes specialized expertise with adversarial-aware reinforcement learning. Our framework employs five cooperative agents (text, URL, metadata, explanation simplifier, and adversarial agents) with automatically adjusted decision weights powered by a Proximal Policy Optimization reinforcement learning algorithm. To address emerging threats, we introduce an adversarial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsUmbrella Reinforcement Learning