Learning to Communicate in Multi-Agent Reinforcement Learning for Autonomous Cyber Defence

Faizan Contractor; Li Li; Ranwa Al Mallah

arXiv:2507.14658·cs.MA·July 22, 2025

Learning to Communicate in Multi-Agent Reinforcement Learning for Autonomous Cyber Defence

Faizan Contractor, Li Li, Ranwa Al Mallah

PDF

TL;DR

This paper introduces a multi-agent reinforcement learning framework where autonomous cyber defense agents learn to communicate and coordinate effectively against cyber threats, improving decision-making in complex, partially observable environments.

Contribution

It presents a novel game-based training approach using Differentiable Inter Agent Learning for cyber defense, enabling agents to learn both tactical policies and minimal communication messages.

Findings

01

Agents develop human-like incident response tactics

02

Communication improves coordination and threat mitigation

03

Minimal communication messages are learned alongside defense policies

Abstract

Popular methods in cooperative Multi-Agent Reinforcement Learning with partially observable environments typically allow agents to act independently during execution, which may limit the coordinated effect of the trained policies. However, by sharing information such as known or suspected ongoing threats, effective communication can lead to improved decision-making in the cyber battle space. We propose a game design where defender agents learn to communicate and defend against imminent cyber threats by playing training games in the Cyber Operations Research Gym, using the Differentiable Inter Agent Learning algorithm adapted to the cyber operational environment. The tactical policies learned by these autonomous agents are akin to those of human experts during incident responses to avert cyber threats. In addition, the agents simultaneously learn minimal cost communication messages while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.