Learning Security Strategies through Game Play and Optimal Stopping
Kim Hammar, Rolf Stadler

TL;DR
This paper introduces a reinforcement learning approach to automated intrusion prevention by modeling attacker-defender interactions as an optimal stopping game, leading to effective defense strategies with threshold properties.
Contribution
It formulates intrusion prevention as an optimal stopping game and develops T-FP, a self-play algorithm that learns Nash equilibria, outperforming existing methods.
Findings
T-FP effectively learns Nash equilibria in security games.
Optimal strategies exhibit threshold properties.
The approach produces practical defense strategies for IT infrastructure.
Abstract
We study automated intrusion prevention using reinforcement learning. Following a novel approach, we formulate the interaction between an attacker and a defender as an optimal stopping game and let attack and defense strategies evolve through reinforcement learning and self-play. The game-theoretic perspective allows us to find defender strategies that are effective against dynamic attackers. The optimal stopping formulation gives us insight into the structure of optimal strategies, which we show to have threshold properties. To obtain the optimal defender strategies, we introduce T-FP, a fictitious self-play algorithm that learns Nash equilibria through stochastic approximation. We show that T-FP outperforms a state-of-the-art algorithm for our use case. Our overall method for learning and evaluating strategies includes two systems: a simulation system where defender strategies are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNetwork Security and Intrusion Detection · Information and Cyber Security · Smart Grid Security and Resilience
