Deep hierarchical reinforcement agents for automated penetration testing
Khuong Tran (1), Ashlesha Akella (1), Maxwell Standen (2), Junae Kim, (2), David Bowman (2), Toby Richer (2), Chin-Teng Lin (1) ((1) Institution, One, (2) Institution Two)

TL;DR
This paper introduces HA-DRL, a hierarchical deep reinforcement learning architecture that improves the efficiency and stability of autonomous penetration testing by effectively managing large action spaces.
Contribution
The paper presents a novel hierarchical deep reinforcement learning architecture with algebraic action decomposition for automated penetration testing.
Findings
HA-DRL finds optimal attack policies faster.
HA-DRL demonstrates more stable learning compared to conventional deep Q-learning.
Effective handling of large discrete action spaces in cybersecurity simulations.
Abstract
Penetration testing the organised attack of a computer system in order to test existing defences has been used extensively to evaluate network security. This is a time consuming process and requires in-depth knowledge for the establishment of a strategy that resembles a real cyber-attack. This paper presents a novel deep reinforcement learning architecture with hierarchically structured agents called HA-DRL, which employs an algebraic action decomposition strategy to address the large discrete action space of an autonomous penetration testing simulator where the number of actions is exponentially increased with the complexity of the designed cybersecurity network. The proposed architecture is shown to find the optimal attacking policy faster and more stably than a conventional deep Q-learning agent which is commonly used as a method to apply artificial intelligence in automatic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Adversarial Robustness in Machine Learning · Software Testing and Debugging Techniques
MethodsQ-Learning
