Learning in Stackelberg Games with Non-myopic Agents
Nika Haghtalab, Thodoris Lykouris, Sloan Nietert, Alexander Wei

TL;DR
This paper introduces a framework for learning in Stackelberg games with non-myopic agents, addressing strategic manipulation and improving query complexity in security games.
Contribution
It provides a reduction to robust bandit optimization for non-myopic agents and develops algorithms that are robust to misspecifications, improving learning efficiency.
Findings
Reduced query complexity in security games from O(n^3) to near O(n).
Developed robust algorithms for various Stackelberg game settings.
Characterized the impact of misspecifications in near-best responses.
Abstract
We study Stackelberg games where a principal repeatedly interacts with a non-myopic long-lived agent, without knowing the agent's payoff function. Although learning in Stackelberg games is well-understood when the agent is myopic, dealing with non-myopic agents poses additional complications. In particular, non-myopic agents may strategize and select actions that are inferior in the present in order to mislead the principal's learning algorithm and obtain better outcomes in the future. We provide a general framework that reduces learning in presence of non-myopic agents to robust bandit optimization in the presence of myopic agents. Through the design and analysis of minimally reactive bandit algorithms, our reduction trades off the statistical efficiency of the principal's learning algorithm against its effectiveness in inducing near-best-responses. We apply this framework to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Game Theory and Applications
