Multi-agent Reinforcement Learning in Bayesian Stackelberg Markov Games   for Adaptive Moving Target Defense

Sailik Sengupta; Subbarao Kambhampati

arXiv:2007.10457·cs.GT·July 22, 2020·22 cites

Multi-agent Reinforcement Learning in Bayesian Stackelberg Markov Games for Adaptive Moving Target Defense

Sailik Sengupta, Subbarao Kambhampati

PDF

Open Access

TL;DR

This paper introduces a game-theoretic framework and a learning algorithm for adaptive moving target defense in cybersecurity, effectively modeling uncertainty about attackers and improving defense strategies in sequential, incomplete-information scenarios.

Contribution

The paper proposes Bayesian Stackelberg Markov Games and a Bayesian Q-learning approach to optimize defense strategies under uncertainty, addressing limitations of prior models.

Findings

01

The approach converges to a Strong Stackelberg Equilibrium.

02

It enhances web-application security through improved MTD strategies.

03

The method works without prior reward or transition knowledge.

Abstract

The field of cybersecurity has mostly been a cat-and-mouse game with the discovery of new attacks leading the way. To take away an attacker's advantage of reconnaissance, researchers have proposed proactive defense methods such as Moving Target Defense (MTD). To find good movement strategies, researchers have modeled MTD as leader-follower games between the defender and a cyber-adversary. We argue that existing models are inadequate in sequential settings when there is incomplete information about a rational adversary and yield sub-optimal movement strategies. Further, while there exists an array of work on learning defense policies in sequential settings for cyber-security, they are either unpopular due to scalability issues arising out of incomplete information or tend to ignore the strategic nature of the adversary simplifying the scenario to use single-agent reinforcement learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNetwork Security and Intrusion Detection · Information and Cyber Security · Infrastructure Resilience and Vulnerability Analysis

MethodsStochastic Steady-state Embedding · Q-Learning