Learning Communication Between Heterogeneous Agents in Multi-Agent Reinforcement Learning for Autonomous Cyber Defence

Alex Popa; Adrian Taylor; Ranwa Al Mallah

arXiv:2603.20279·cs.CR·March 24, 2026

Learning Communication Between Heterogeneous Agents in Multi-Agent Reinforcement Learning for Autonomous Cyber Defence

Alex Popa, Adrian Taylor, Ranwa Al Mallah

PDF

Open Access

TL;DR

This paper investigates how heterogeneous multi-agent reinforcement learning agents can communicate effectively to enhance cyber defense, demonstrating improved convergence speed and accuracy over existing algorithms in a simulated environment.

Contribution

It introduces the use of CommFormer for heterogeneous agents in cyber defense, showing significant performance improvements in a simulated network environment.

Findings

01

Agents with heterogeneous capabilities outperform others in convergence speed.

02

CommFormer agents reduce standard error by up to 38%.

03

Heterogeneous agents can be trained effectively for cyber security tasks.

Abstract

Reinforcement learning techniques are being explored as solutions to the threat of cyber attacks on enterprise networks. Recent research in the field of AI in cyber security has investigated the ability of homogeneous multi-agent reinforcement learning agents, capable of inter-agent communication, to respond to cyberattacks. This paper advances the study of learned communication in multi-agent systems by examining heterogeneous agent capabilities within a simulated network environment. To this end, we leverage CommFormer, a publicly available state-of-the-art communication algorithm, to train and evaluate agents within the Cyber Operations Research Gym (CybORG). Our results show that CommFormer agents with heterogeneous capabilities can outperform other algorithms deployed in the CybORG environment, by converging to an optimal policy up to four times faster while improving standard…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware-Defined Networks and 5G · Reinforcement Learning in Robotics · Network Security and Intrusion Detection