BountyBench: Dollar Impact of AI Agent Attackers and Defenders on Real-World Cybersecurity Systems

Andy K. Zhang; Joey Ji; Celeste Menders; Riya Dulepet; Thomas Qin; Ron Y. Wang; Junrong Wu; Kyleen Liao; Jiliang Li; Jinghan Hu; Sara Hong; Nardos Demilew; Shivatmica Murgai; Jason Tran; Nishka Kacheria; Ethan Ho; Denis Liu; Lauren McLane; Olivia Bruvik; Dai-Rong Han; Seungwoo Kim; Akhil Vyas; Cuiyuanxiu Chen; Ryan Li; Weiran Xu; Jonathan Z. Ye; Prerit Choudhary; Siddharth M. Bhatia; Vikram Sivashankar; Yuxuan Bao; Dawn Song; Dan Boneh; Daniel E. Ho; Percy Liang

arXiv:2505.15216·cs.CR·December 3, 2025

BountyBench: Dollar Impact of AI Agent Attackers and Defenders on Real-World Cybersecurity Systems

Andy K. Zhang, Joey Ji, Celeste Menders, Riya Dulepet, Thomas Qin, Ron Y. Wang, Junrong Wu, Kyleen Liao, Jiliang Li, Jinghan Hu, Sara Hong, Nardos Demilew, Shivatmica Murgai, Jason Tran, Nishka Kacheria, Ethan Ho, Denis Liu, Lauren McLane, Olivia Bruvik, Dai-Rong Han

PDF

Open Access 1 Video

TL;DR

This paper introduces BountyBench, a comprehensive framework for evaluating AI agents' offensive and defensive cybersecurity capabilities on real-world systems, with monetary incentives and diverse vulnerability tasks.

Contribution

It presents the first framework to quantify AI agents' cybersecurity skills in real-world codebases using monetary bounties and multiple vulnerability tasks.

Findings

01

Top agents achieved up to 90% success in patching vulnerabilities.

02

Agents showed varying strengths in detection, exploitation, and patching tasks.

03

Monetary rewards correlated with agent performance across tasks.

Abstract

AI agents have the potential to significantly alter the cybersecurity landscape. Here, we introduce the first framework to capture offensive and defensive cyber-capabilities in evolving real-world systems. Instantiating this framework with BountyBench, we set up 25 systems with complex, real-world codebases. To capture the vulnerability lifecycle, we define three task types: Detect (detecting a new vulnerability), Exploit (exploiting a given vulnerability), and Patch (patching a given vulnerability). For Detect, we construct a new success indicator, which is general across vulnerability types and provides localized evaluation. We manually set up the environment for each system, including installing packages, setting up server(s), and hydrating database(s). We add 40 bug bounties, which are vulnerabilities with monetary awards from $10 to $30,485, covering 9 of the OWASP Top 10 Risks.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

BountyBench: Dollar Impact of AI Agent Attackers and Defenders on Real-World Cybersecurity Systems· slideslive

Taxonomy

TopicsBlockchain Technology Applications and Security · Network Security and Intrusion Detection

MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Multi-Head Attention · Dense Connections · Softmax · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Residual Connection · Byte Pair Encoding