Building Better Environments for Autonomous Cyber Defence

Chris Hicks; Elizabeth Bates; Shae McFadden; Isaac Symes Thompson; Myles Foley; Ed Chapman; Nickolas Espinosa Dice; Ankita Samaddar; Joshua Sylvester; Himanshu Neema; Nicholas Butts; Nate Foster; Ahmad Ridley; Zoe M; Paul Jones

arXiv:2604.08805·cs.CR·April 13, 2026

Building Better Environments for Autonomous Cyber Defence

Chris Hicks, Elizabeth Bates, Shae McFadden, Isaac Symes Thompson, Myles Foley, Ed Chapman, Nickolas Espinosa Dice, Ankita Samaddar, Joshua Sylvester, Himanshu Neema, Nicholas Butts, Nate Foster, Ahmad Ridley, Zoe M, Paul Jones

PDF

TL;DR

This paper presents a framework and guidelines for designing effective reinforcement learning environments for autonomous cyber defense, based on expert insights from a multidisciplinary workshop.

Contribution

It introduces a decomposition framework for RL cyber environments and offers best practice guidelines for environment development and agent evaluation.

Findings

01

A comprehensive framework for RL environment design in cyber defense.

02

Best practices for building and evaluating RL agents in network security scenarios.

03

Insights from multidisciplinary experts on common hazards and domain knowledge.

Abstract

In November 2025, the authors ran a workshop on the topic of what makes a good reinforcement learning (RL) environment for autonomous cyber defence (ACD). This paper details the knowledge shared by participants both during the workshop and shortly afterwards by contributing herein. The workshop participants come from academia, industry, and government, and have extensive hands-on experience designing and working with RL and cyber environments. While there is now a sizeable body of literature describing work in RL for ACD, there is nevertheless a great deal of tradecraft, domain knowledge, and common hazards which are not detailed comprehensively in a single resource. With a specific focus on building better environments to train and evaluate autonomous RL agents in network defence scenarios, including government and critical infrastructure networks, the contributions of this work are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.