Co-RedTeam: Orchestrated Security Discovery and Exploitation with LLM Agents

Pengfei He; Ash Fox; Lesly Miculicich; Stefan Friedli; Daniel Fabian; Burak Gokturk; Jiliang Tang; Chen-Yu Lee; Tomas Pfister; Long T. Le

arXiv:2602.02164·cs.LG·February 5, 2026

Co-RedTeam: Orchestrated Security Discovery and Exploitation with LLM Agents

Pengfei He, Ash Fox, Lesly Miculicich, Stefan Friedli, Daniel Fabian, Burak Gokturk, Jiliang Tang, Chen-Yu Lee, Tomas Pfister, Long T. Le

PDF

Open Access

TL;DR

Co-RedTeam is a multi-agent framework that enhances cybersecurity vulnerability discovery and exploitation by integrating real-time execution feedback, structured reasoning, and memory, significantly outperforming existing methods.

Contribution

It introduces a novel multi-agent system that mimics red-teaming workflows with execution-grounded reasoning, improving vulnerability detection and exploitation capabilities.

Findings

01

Achieves over 60% success rate in vulnerability exploitation.

02

Over 10% absolute improvement in vulnerability detection.

03

Execution feedback and memory are critical for robustness.

Abstract

Large language models (LLMs) have shown promise in assisting cybersecurity tasks, yet existing approaches struggle with automatic vulnerability discovery and exploitation due to limited interaction, weak execution grounding, and a lack of experience reuse. We propose Co-RedTeam, a security-aware multi-agent framework designed to mirror real-world red-teaming workflows by integrating security-domain knowledge, code-aware analysis, execution-grounded iterative reasoning, and long-term memory. Co-RedTeam decomposes vulnerability analysis into coordinated discovery and exploitation stages, enabling agents to plan, execute, validate, and refine actions based on real execution feedback while learning from prior trajectories. Extensive evaluations on challenging security benchmarks demonstrate that Co-RedTeam consistently outperforms strong baselines across diverse backbone models, achieving…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInformation and Cyber Security · Adversarial Robustness in Machine Learning · Advanced Malware Detection Techniques