Adversarial Reinforcement Learning under Partial Observability in Autonomous Computer Network Defence
Yi Han, David Hubczenko, Paul Montague, Olivier De Vel, Tamas Abraham,, Benjamin I.P. Rubinstein, Christopher Leckie, Tansu Alpcan, Sarah Erfani

TL;DR
This paper investigates the vulnerability of reinforcement learning agents in autonomous cyber defense to causative adversarial attacks under partial observability and proposes an inversion defense method to mitigate such attacks.
Contribution
It introduces the first study of causative attacks on RL in cyber defense under partial observability and proposes an effective inversion defense strategy.
Findings
Causative attacks can poison RL agents even with partial environment knowledge.
The inversion defense reduces attack impact without harming normal training.
Defense maintains effectiveness in black-box attack scenarios.
Abstract
Recent studies have demonstrated that reinforcement learning (RL) agents are susceptible to adversarial manipulation, similar to vulnerabilities previously demonstrated in the supervised learning setting. While most existing work studies the problem in the context of computer vision or console games, this paper focuses on reinforcement learning in autonomous cyber defence under partial observability. We demonstrate that under the black-box setting, where the attacker has no direct access to the target RL model, causative attacks---attacks that target the training process---can poison RL agents even if the attacker only has partial observability of the environment. In addition, we propose an inversion defence method that aims to apply the opposite perturbation to that which an attacker might use to generate their adversarial samples. Our experimental results illustrate that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Smart Grid Security and Resilience
