TL;DR
This paper investigates how a follower can strategically deceive a learning algorithm in Stackelberg games to manipulate the leader's strategy, extending previous finite payoff results to more general settings.
Contribution
It introduces methods for followers to compute (near-)optimal payoffs in general, non-finite payoff scenarios, enhancing understanding of strategic deception in Stackelberg learning.
Findings
Followers can compute near-optimal payoffs in general scenarios.
Deception strategies extend beyond finite payoff spaces.
The results apply to various learning interaction models.
Abstract
Recent results in the ML community have revealed that learning algorithms used to compute the optimal strategy for the leader to commit to in a Stackelberg game, are susceptible to manipulation by the follower. Such a learning algorithm operates by querying the best responses or the payoffs of the follower, who consequently can deceive the algorithm by responding as if his payoffs were much different than what they actually are. For this strategic behavior to be successful, the main challenge faced by the follower is to pinpoint the payoffs that would make the learning algorithm compute a commitment so that best responding to it maximizes the follower's utility, according to his true payoffs. While this problem has been considered before, the related literature only focused on the simplified scenario in which the payoff space is finite, thus leaving the general version of the problem…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
