Adaptive Learning in Two-Player Stackelberg Games with Application to   Network Security

Guosong Yang; Radha Poovendran; Jo\~ao P. Hespanha

arXiv:2101.03253·cs.GT·January 12, 2021

Adaptive Learning in Two-Player Stackelberg Games with Application to Network Security

Guosong Yang, Radha Poovendran, Jo\~ao P. Hespanha

PDF

Open Access

TL;DR

This paper introduces an adaptive learning method for two-player Stackelberg games with incomplete information, enabling the leader to estimate unknown parameters and optimize strategies, with proven convergence guarantees and applications to network security.

Contribution

It develops a novel adaptive control-based algorithm for parameter estimation and strategy optimization in Stackelberg games with incomplete info, ensuring finite-time convergence.

Findings

01

Leader's cost prediction becomes indistinguishable from actual cost in finite time.

02

Parameter estimation error can be bounded by an arbitrarily small threshold.

03

Algorithm demonstrates effective convergence in network security simulations.

Abstract

We study a two-player Stackelberg game with incomplete information such that the follower's strategy belongs to a known family of parameterized functions with an unknown parameter vector. We design an adaptive learning approach to simultaneously estimate the unknown parameter and minimize the leader's cost, based on adaptive control techniques and hysteresis switching. Our approach guarantees that the leader's cost predicted using the parameter estimate becomes indistinguishable from its actual cost in finite time, up to a preselected, arbitrarily small error threshold. Also, the first-order necessary condition for optimality holds asymptotically for the predicted cost. Additionally, if a persistent excitation condition holds, then the parameter estimation error becomes bounded by a preselected, arbitrarily small threshold in finite time as well. For the case where there is a mismatch…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMathematical and Theoretical Epidemiology and Ecology Models · Distributed Control Multi-Agent Systems · Markov Chains and Monte Carlo Methods