Deep hierarchical reinforcement agents for automated penetration testing

Khuong Tran (1); Ashlesha Akella (1); Maxwell Standen (2); Junae Kim; (2); David Bowman (2); Toby Richer (2); Chin-Teng Lin (1) ((1) Institution; One; (2) Institution Two)

arXiv:2109.06449·cs.AI·September 15, 2021·23 cites

Deep hierarchical reinforcement agents for automated penetration testing

Khuong Tran (1), Ashlesha Akella (1), Maxwell Standen (2), Junae Kim, (2), David Bowman (2), Toby Richer (2), Chin-Teng Lin (1) ((1) Institution, One, (2) Institution Two)

PDF

Open Access

TL;DR

This paper introduces HA-DRL, a hierarchical deep reinforcement learning architecture that improves the efficiency and stability of autonomous penetration testing by effectively managing large action spaces.

Contribution

The paper presents a novel hierarchical deep reinforcement learning architecture with algebraic action decomposition for automated penetration testing.

Findings

01

HA-DRL finds optimal attack policies faster.

02

HA-DRL demonstrates more stable learning compared to conventional deep Q-learning.

03

Effective handling of large discrete action spaces in cybersecurity simulations.

Abstract

Penetration testing the organised attack of a computer system in order to test existing defences has been used extensively to evaluate network security. This is a time consuming process and requires in-depth knowledge for the establishment of a strategy that resembles a real cyber-attack. This paper presents a novel deep reinforcement learning architecture with hierarchically structured agents called HA-DRL, which employs an algebraic action decomposition strategy to address the large discrete action space of an autonomous penetration testing simulator where the number of actions is exponentially increased with the complexity of the designed cybersecurity network. The proposed architecture is shown to find the optimal attacking policy faster and more stably than a conventional deep Q-learning agent which is commonly used as a method to apply artificial intelligence in automatic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Adversarial Robustness in Machine Learning · Software Testing and Debugging Techniques

MethodsQ-Learning