Knowledge-Informed Auto-Penetration Testing Based on Reinforcement   Learning with Reward Machine

Yuanliang Li; Hanzheng Dai; Jun Yan

arXiv:2405.15908·cs.AI·May 28, 2024

Knowledge-Informed Auto-Penetration Testing Based on Reinforcement Learning with Reward Machine

Yuanliang Li, Hanzheng Dai, Jun Yan

PDF

Open Access

TL;DR

This paper introduces DRLRM-PT, a reinforcement learning framework for automated penetration testing that incorporates domain knowledge via reward machines, improving training efficiency and effectiveness in lateral movement scenarios.

Contribution

The paper proposes a novel knowledge-informed AutoPT framework using reward machines to encode domain knowledge, enhancing RL training efficiency and interpretability in penetration testing.

Findings

01

DRLRM-PT outperforms knowledge-agnostic agents in training efficiency.

02

Detailed domain knowledge in RMs leads to better penetration testing performance.

03

The framework effectively guides RL in lateral movement scenarios using POMDPs.

Abstract

Automated penetration testing (AutoPT) based on reinforcement learning (RL) has proven its ability to improve the efficiency of vulnerability identification in information systems. However, RL-based PT encounters several challenges, including poor sampling efficiency, intricate reward specification, and limited interpretability. To address these issues, we propose a knowledge-informed AutoPT framework called DRLRM-PT, which leverages reward machines (RMs) to encode domain knowledge as guidelines for training a PT policy. In our study, we specifically focus on lateral movement as a PT case study and formulate it as a partially observable Markov decision process (POMDP) guided by RMs. We design two RMs based on the MITRE ATT\&CK knowledge base for lateral movement. To solve the POMDP and optimize the PT policy, we employ the deep Q-learning algorithm with RM (DQRM). The experimental…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNetwork Security and Intrusion Detection · Software Testing and Debugging Techniques · Advanced Malware Detection Techniques

MethodsBalanced Selection · Focus · Q-Learning