Adaptive Learning in Two-Player Stackelberg Games with Application to Network Security
Guosong Yang, Radha Poovendran, Jo\~ao P. Hespanha

TL;DR
This paper introduces an adaptive learning method for two-player Stackelberg games with incomplete information, enabling the leader to estimate unknown parameters and optimize strategies, with proven convergence guarantees and applications to network security.
Contribution
It develops a novel adaptive control-based algorithm for parameter estimation and strategy optimization in Stackelberg games with incomplete info, ensuring finite-time convergence.
Findings
Leader's cost prediction becomes indistinguishable from actual cost in finite time.
Parameter estimation error can be bounded by an arbitrarily small threshold.
Algorithm demonstrates effective convergence in network security simulations.
Abstract
We study a two-player Stackelberg game with incomplete information such that the follower's strategy belongs to a known family of parameterized functions with an unknown parameter vector. We design an adaptive learning approach to simultaneously estimate the unknown parameter and minimize the leader's cost, based on adaptive control techniques and hysteresis switching. Our approach guarantees that the leader's cost predicted using the parameter estimate becomes indistinguishable from its actual cost in finite time, up to a preselected, arbitrarily small error threshold. Also, the first-order necessary condition for optimality holds asymptotically for the predicted cost. Additionally, if a persistent excitation condition holds, then the parameter estimation error becomes bounded by a preselected, arbitrarily small threshold in finite time as well. For the case where there is a mismatch…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematical and Theoretical Epidemiology and Ecology Models · Distributed Control Multi-Agent Systems · Markov Chains and Monte Carlo Methods
