An Empirical Game-Theoretic Analysis of Autonomous Cyber-Defence Agents
Gregory Palmer, Luke Swaby, Daniel J.B. Harrold, Matthew Stewart, Alex, Hiles, Chris Willis, Ian Miles, Sara Farmer

TL;DR
This paper employs an empirical game-theoretic approach to analyze deep reinforcement learning-based autonomous cyber-defense agents, introducing reward shaping and multiple response oracles to improve evaluation and robustness.
Contribution
It introduces a potential-based reward shaping method and extends the double oracle algorithm to multiple response oracles for comprehensive evaluation of ACD-DRL approaches.
Findings
Reward shaping accelerates the double oracle process.
Extended framework enables holistic evaluation of multiple ACD approaches.
Empirical analysis demonstrates improved robustness of ACD agents.
Abstract
The recent rise in increasingly sophisticated cyber-attacks raises the need for robust and resilient autonomous cyber-defence (ACD) agents. Given the variety of cyber-attack tactics, techniques and procedures (TTPs) employed, learning approaches that can return generalisable policies are desirable. Meanwhile, the assurance of ACD agents remains an open challenge. We address both challenges via an empirical game-theoretic analysis of deep reinforcement learning (DRL) approaches for ACD using the principled double oracle (DO) algorithm. This algorithm relies on adversaries iteratively learning (approximate) best responses against each others' policies; a computationally expensive endeavour for autonomous cyber operations agents. In this work we introduce and evaluate a theoretically-sound, potential-based reward shaping approach to expedite this process. In addition, given the increasing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOpinion Dynamics and Social Influence · Network Security and Intrusion Detection
