Control at Stake: Evaluating the Security Landscape of LLM-Driven Email Agents

Jiangrong Wu; Yuhong Nan; Jianliang Wu; Zitong Yao; Zibin Zheng

arXiv:2507.02699·cs.CR·July 4, 2025

Control at Stake: Evaluating the Security Landscape of LLM-Driven Email Agents

Jiangrong Wu, Yuhong Nan, Jianliang Wu, Zitong Yao, Zibin Zheng

PDF

TL;DR

This paper presents the first systematic security analysis of LLM email agents, introducing the EAH attack that can hijack these agents with minimal attempts, revealing significant security vulnerabilities across multiple frameworks and email services.

Contribution

It introduces the EAH attack and EAHawk evaluation pipeline, providing a comprehensive empirical study of security risks in LLM email agents across diverse platforms.

Findings

01

All tested instances were successfully hijacked

02

Average of 2.03 attempts needed for control

03

Some LLMs required as few as 1.23 attempts

Abstract

The increasing capabilities of LLMs have led to the rapid proliferation of LLM agent apps, where developers enhance LLMs with access to external resources to support complex task execution. Among these, LLM email agent apps represent one of the widely used categories, as email remains a critical communication medium for users. LLM email agents are capable of managing and responding to email using LLM-driven reasoning and autonomously executing user instructions via external email APIs (e.g., send email). However, despite their growing deployment and utility, the security mechanism of LLM email agent apps remains underexplored. Currently, there is no comprehensive study into the potential security risk within these agent apps and their broader implications. In this paper, we conduct the first in-depth and systematic security study of LLM email agents. We propose the Email Agent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.