TL;DR
This paper introduces an entity-based reinforcement learning approach using Transformers to improve the generalisation of autonomous cyber defence agents across diverse and changing network topologies, outperforming traditional MLP methods.
Contribution
The paper proposes a novel entity-based reinforcement learning framework with Transformer policies that enhances generalisation in autonomous cyber defence across varying network configurations.
Findings
Transformer-based policies outperform MLPs on diverse network topologies
The approach enables zero-shot generalisation to unseen network sizes
Significant improvement in policy robustness across dynamic environments
Abstract
A significant challenge for autonomous cyber defence is ensuring a defensive agent's ability to generalise across diverse network topologies and configurations. This capability is necessary for agents to remain effective when deployed in dynamically changing environments, such as an enterprise network where devices may frequently join and leave. Standard approaches to deep reinforcement learning, where policies are parameterised using a fixed-input multi-layer perceptron (MLP) expect fixed-size observation and action spaces. In autonomous cyber defence, this makes it hard to develop agents that generalise to environments with network topologies different from those trained on, as the number of nodes affects the natural size of the observation and action spaces. To overcome this limitation, we reframe the problem of autonomous network defence using entity-based reinforcement learning,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
