Autonomous Penetration Testing using Reinforcement Learning
Jonathon Schwartz, Hanna Kurniawati

TL;DR
This paper explores using model-free reinforcement learning to automate penetration testing, demonstrating its potential to identify attack paths without relying on environment models, though scalability remains a challenge.
Contribution
It introduces a novel application of model-free RL for automated pentesting and develops a fast simulator to train and evaluate RL agents in this context.
Findings
RL agents found optimal attack paths in simulated environments
Tabular and neural network RL approaches worked for small networks
Scalability issues limit current methods to small network sizes
Abstract
Penetration testing (pentesting) involves performing a controlled attack on a computer system in order to assess it's security. Although an effective method for testing security, pentesting requires highly skilled practitioners and currently there is a growing shortage of skilled cyber security professionals. One avenue for alleviating this problem is automate the pentesting process using artificial intelligence techniques. Current approaches to automated pentesting have relied on model-based planning, however the cyber security landscape is rapidly changing making maintaining up-to-date models of exploits a challenge. This project investigated the application of model-free Reinforcement Learning (RL) to automated pentesting. Model-free RL has the key advantage over model-based planning of not requiring a model of the environment, instead learning the best policy through interaction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Adversarial Robustness in Machine Learning · Information and Cyber Security
MethodsQ-Learning
