Learning is Revelation in Disguise: Improved Regret and Equivalence Results for Dynamic Pricing
Shiliang Zuo

TL;DR
This paper improves regret bounds for dynamic pricing with menu mechanisms and reveals an equivalence between adaptive learning and mechanism design, showing that learning is essentially a form of revelation.
Contribution
It introduces improved regret bounds for menu mechanisms and establishes a fundamental equivalence between adaptive learning algorithms and direct revelation mechanisms.
Findings
Menu mechanisms achieve $O(T_eta \, \log T_eta)$ regret, surpassing previous bounds.
Adaptive learning and revelation mechanisms have identical optimal regret, unifying two paradigms.
Learning can be viewed as a form of revelation in mechanism design.
Abstract
We study dynamic pricing where a seller repeatedly interacts with a strategic, non-myopic buyer who has a fixed private valuation and discounts future utility. Prior work focused exclusively on posted-price mechanisms, which only extract binary accept/reject signals. For our first result, we show that menu mechanisms-offering allocation-payment contracts are able to achieve regret, where is the buyer's effective discounted time horizon, improving all prior bounds. Our second contribution is more conceptual in nature. The problem of dynamic pricing sits at the intersection of two paradigms: adaptive learning in computer science / machine learning and revelation-principle-based mechanism design in economics-yet their relationship has remained unclear. We establish a fundamental equivalence: indirect learning mechanisms and direct revelation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
