NSGZero: Efficiently Learning Non-Exploitable Policy in Large-Scale   Network Security Games with Neural Monte Carlo Tree Search

Wanqi Xue; Bo An; Chai Kiat Yeo

arXiv:2201.07224·cs.CR·January 20, 2022

NSGZero: Efficiently Learning Non-Exploitable Policy in Large-Scale Network Security Games with Neural Monte Carlo Tree Search

Wanqi Xue, Bo An, Chai Kiat Yeo

PDF

Open Access 1 Video

TL;DR

NSGZero introduces a neural Monte Carlo Tree Search approach for large-scale network security games, significantly improving data efficiency and scalability in learning non-exploitable policies.

Contribution

It presents a novel deep learning method with neural MCTS and decentralized control for efficient, scalable policy learning in large network security games.

Findings

01

Outperforms state-of-the-art algorithms in data efficiency

02

Scales effectively to scenarios with many resources

03

Achieves better policies in large-scale NSGs

Abstract

How resources are deployed to secure critical targets in networks can be modelled by Network Security Games (NSGs). While recent advances in deep learning (DL) provide a powerful approach to dealing with large-scale NSGs, DL methods such as NSG-NFSP suffer from the problem of data inefficiency. Furthermore, due to centralized control, they cannot scale to scenarios with a large number of resources. In this paper, we propose a novel DL-based method, NSGZero, to learn a non-exploitable policy in NSGs. NSGZero improves data efficiency by performing planning with neural Monte Carlo Tree Search (MCTS). Our main contributions are threefold. First, we design deep neural networks (DNNs) to perform neural MCTS in NSGs. Second, we enable neural MCTS with decentralized control, making NSGZero applicable to NSGs with many resources. Third, we provide an efficient learning paradigm, to achieve joint…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

NSGZero: Efficiently Learning Non-Exploitable Policy in Large-Scale Network Security Games with Neural Monte Carlo Tree Search· underline

Taxonomy

TopicsNetwork Security and Intrusion Detection · Access Control and Trust · Information and Cyber Security