D-CIPHER: Dynamic Collaborative Intelligent Multi-Agent System with Planner and Heterogeneous Executors for Offensive Security

Meet Udeshi; Minghao Shao; Haoran Xi; Nanda Rani; Kimberly Milner; Venkata Sai Charan Putrevu; Brendan Dolan-Gavitt; Sandeep Kumar Shukla; Prashanth Krishnamurthy; Farshad Khorrami; Ramesh Karri; Muhammad Shafique

arXiv:2502.10931·cs.AI·May 13, 2025

D-CIPHER: Dynamic Collaborative Intelligent Multi-Agent System with Planner and Heterogeneous Executors for Offensive Security

Meet Udeshi, Minghao Shao, Haoran Xi, Nanda Rani, Kimberly Milner, Venkata Sai Charan Putrevu, Brendan Dolan-Gavitt, Sandeep Kumar Shukla, Prashanth Krishnamurthy, Farshad Khorrami, Ramesh Karri, Muhammad Shafique

PDF

Open Access 1 Repo

TL;DR

D-CIPHER introduces a multi-agent framework with specialized roles and dynamic feedback for improved autonomous cybersecurity challenge solving, outperforming previous single-agent approaches on multiple benchmarks.

Contribution

The paper presents a novel multi-agent system with a planner and heterogeneous executors, along with an auto-prompter, to enhance LLM-based cybersecurity task-solving capabilities.

Findings

01

Achieves state-of-the-art performance on multiple CTF benchmarks.

02

Solves 65% more ATT&CK techniques than previous methods.

03

Outperforms prior approaches by 2.5% to 8.5% on key benchmarks.

Abstract

Large Language Models (LLMs) have been used in cybersecurity such as autonomous security analysis or penetration testing. Capture the Flag (CTF) challenges serve as benchmarks to assess automated task-planning abilities of LLM agents for cybersecurity. Early attempts to apply LLMs for solving CTF challenges used single-agent systems, where feedback was restricted to a single reasoning-action loop. This approach was inadequate for complex CTF tasks. Inspired by real-world CTF competitions, where teams of experts collaborate, we introduce the D-CIPHER LLM multi-agent framework for collaborative CTF solving. D-CIPHER integrates agents with distinct roles with dynamic feedback loops to enhance reasoning on complex tasks. It introduces the Planner-Executor agent system, consisting of a Planner agent for overall problem-solving along with multiple heterogeneous Executor agents for individual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nyu-llm-ctf/nyuctf_agents
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNetwork Security and Intrusion Detection · Multi-Agent Systems and Negotiation · Advanced Malware Detection Techniques