Contextual Online Pricing with (Biased) Offline Data
Yixuan Zhang, Ruihao Zhu, Qiaomin Xie

TL;DR
This paper develops optimal online pricing algorithms that leverage biased offline data, providing tight regret bounds and extending to stochastic linear bandits, addressing a key challenge in data-driven pricing strategies.
Contribution
It introduces instance-dependent regret bounds for contextual pricing with biased offline data and proposes algorithms that achieve these bounds, including a robust variant for unknown bias.
Findings
Optimal regret bounds for scalar price elasticity case.
Extension of bounds to general price elasticity.
Robust algorithm for unknown bias scenarios.
Abstract
We study contextual online pricing with biased offline data. For the scalar price elasticity case, we identify the instance-dependent quantity that measures how far the offline data lies from the (unknown) online optimum. We show that the time length , bias bound , size and dispersion of the offline data, and jointly determine the statistical complexity. An Optimism-in-the-Face-of-Uncertainty (OFU) policy achieves a minimax-optimal, instance-dependent regret bound . For general price elasticity, we establish a worst-case, minimax-optimal rate and provide a generalized OFU algorithm that attains it. When the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
