Stackelberg POMDP: A Reinforcement Learning Approach for Economic Design

Gianluca Brero; Alon Eden; Darshan Chakrabarti; Matthias Gerstgrasser,; Amy Greenwald; Vincent Li; David C. Parkes

arXiv:2210.03852·cs.GT·July 22, 2024·1 cites

Stackelberg POMDP: A Reinforcement Learning Approach for Economic Design

Gianluca Brero, Alon Eden, Darshan Chakrabarti, Matthias Gerstgrasser,, Amy Greenwald, Vincent Li, David C. Parkes

PDF

Open Access 1 Repo

TL;DR

This paper presents a reinforcement learning framework for economic design modeled as a Stackelberg game, formulating the leader's problem as a POMDP and demonstrating its effectiveness through complex scenarios and convergence analysis.

Contribution

It introduces the Stackelberg POMDP framework, connecting POMDP solutions with Stackelberg game strategies and applying it to economic design with strategic followers.

Findings

01

Optimal leader strategies correspond to POMDP policies.

02

Effective training framework demonstrated through ablation studies.

03

Proven convergence to Bayesian coarse-correlated equilibrium.

Abstract

We introduce a reinforcement learning framework for economic design where the interaction between the environment designer and the participants is modeled as a Stackelberg game. In this game, the designer (leader) sets up the rules of the economic system, while the participants (followers) respond strategically. We integrate algorithms for determining followers' response strategies into the leader's learning environment, providing a formulation of the leader's learning problem as a POMDP that we call the Stackelberg POMDP. We prove that the optimal leader's strategy in the Stackelberg game is the optimal policy in our Stackelberg POMDP under a limited set of possible policies, establishing a connection between solving POMDPs and Stackelberg games. We solve our POMDP under a limited set of policy options via the centralized training with decentralized execution framework. For the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

glcbrero/stackelbergpomdp
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuction Theory and Applications · Experimental Behavioral Economics Studies · Decision-Making and Behavioral Economics