Evaluation of Reinforcement Learning for Autonomous Penetration Testing   using A3C, Q-learning and DQN

Norman Becker; Daniel Reti; Evridiki V. Ntagiou; Marcus Wallum; Hans; D. Schotten

arXiv:2407.15656·cs.CR·July 23, 2024·3 cites

Evaluation of Reinforcement Learning for Autonomous Penetration Testing using A3C, Q-learning and DQN

Norman Becker, Daniel Reti, Evridiki V. Ntagiou, Marcus Wallum, Hans, D. Schotten

PDF

Open Access

TL;DR

This paper evaluates reinforcement learning algorithms, specifically A3C, Q-learning, and DQN, for automating penetration testing tasks within a simulated environment, demonstrating A3C's superior performance and generalization capabilities.

Contribution

It introduces the application of RL algorithms to penetration testing scenarios and shows A3C's effectiveness in solving security challenges with fewer actions than traditional methods.

Findings

01

A3C successfully solved all tested scenarios.

02

A3C required fewer actions than baseline automated testing.

03

Hyperparameter tuning improved RL agent performance.

Abstract

Penetration testing is the process of searching for security weaknesses by simulating an attack. It is usually performed by experienced professionals, where scanning and attack tools are applied. By automating the execution of such tools, the need for human interaction and decision-making could be reduced. In this work, a Network Attack Simulator (NASim) was used as an environment to train reinforcement learning agents to solve three predefined security scenarios. These scenarios cover techniques of exploitation, post-exploitation and wiretapping. A large hyperparameter grid search was performed to find the best hyperparameter combinations. The algorithms Q-learning, DQN and A3C were used, whereby A3C was able to solve all scenarios and achieve generalization. In addition, A3C could solve these scenarios with fewer actions than the baseline automated penetration testing. Although the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Software Testing and Debugging Techniques · Advanced Malware Detection Techniques

MethodsDense Connections · Q-Learning · Deep Q-Network · Entropy Regularization · Convolution · Softmax · A3C