Online Learning for Dynamic Vickrey-Clarke-Groves Mechanism in Unknown Environments
Vincent Leon, S. Rasoul Etesami

TL;DR
This paper develops an online reinforcement learning approach for implementing a dynamic VCG mechanism in unknown, evolving auction environments, ensuring desirable economic properties with performance guarantees.
Contribution
It extends the VCG mechanism to a dynamic, unknown environment and proposes a reinforcement learning algorithm to learn and implement this mechanism online.
Findings
The proposed algorithm achieves near-optimal performance with regret guarantees.
The dynamic VCG mechanism maintains efficiency, truthfulness, and individual rationality approximately.
Experimental results demonstrate effective learning in unknown auction environments.
Abstract
We consider the problem of online dynamic mechanism design for sequential auctions in unknown environments, where the underlying market and, thus, the bidders' values vary over time as interactions between the seller and the bidders progress. We model the sequential auctions as an infinite-horizon average-reward Markov decision process (MDP). In each round, the seller determines an allocation and sets a payment for each bidder, while each bidder receives a private reward and submits a sealed bid to the seller. The state, which represents the underlying market, evolves according to an unknown transition kernel and the seller's allocation policy without episodic resets. We first extend the Vickrey-Clarke-Groves (VCG) mechanism to sequential auctions, thereby obtaining a dynamic counterpart that preserves the desired properties: efficiency, truthfulness, and individual rationality. We then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuction Theory and Applications · Advanced Bandit Algorithms Research · Blockchain Technology Applications and Security
MethodsFocus
