Evolving Reinforcement Learning Environment to Minimize Learner's   Achievable Reward: An Application on Hardening Active Directory Systems

Diksha Goel; Aneta Neumann; Frank Neumann; Hung Nguyen; Mingyu Guo

arXiv:2304.03998·cs.NE·April 11, 2023·1 cites

Evolving Reinforcement Learning Environment to Minimize Learner's Achievable Reward: An Application on Hardening Active Directory Systems

Diksha Goel, Aneta Neumann, Frank Neumann, Hung Nguyen, Mingyu Guo

PDF

Open Access

TL;DR

This paper introduces an evolutionary reinforcement learning approach to optimize environment configurations in a Stackelberg game, specifically applied to securing Active Directory systems by minimizing attacker rewards.

Contribution

It proposes combining evolutionary diversity optimization with reinforcement learning to generate diverse, effective environment configurations for defense, improving scalability and defensive strategies.

Findings

01

Outperforms existing methods in generating defensive configurations.

02

Enhances training efficiency through environment diversity.

03

Successfully applied to Active Directory security scenarios.

Abstract

We study a Stackelberg game between one attacker and one defender in a configurable environment. The defender picks a specific environment configuration. The attacker observes the configuration and attacks via Reinforcement Learning (RL trained against the observed environment). The defender's goal is to find the environment with minimum achievable reward for the attacker. We apply Evolutionary Diversity Optimization (EDO) to generate diverse population of environments for training. Environments with clearly high rewards are killed off and replaced by new offsprings to avoid wasting training time. Diversity not only improves training quality but also fits well with our RL scenario: RL agents tend to improve gradually, so a slightly worse environment earlier on may become better later. We demonstrate the effectiveness of our approach by focusing on a specific application, Active…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInformation and Cyber Security · Network Security and Intrusion Detection · Peer-to-Peer Network Technologies