Online Pricing with Offline Data: Phase Transition and Inverse Square Law
Jinzhi Bu, David Simchi-Levi, Yunzong Xu

TL;DR
This paper explores how pre-existing offline data influences online dynamic pricing, revealing phase transitions and an inverse-square law in regret bounds, and proposes an optimal learning algorithm.
Contribution
It characterizes the joint effects of offline data size, location, and dispersion on regret, introducing phase transition phenomena and the inverse-square law in this context.
Findings
Optimal regret depends on offline data size, location, and dispersion.
Phase transitions occur in regret rates based on offline data characteristics.
An algorithm achieving near-optimal regret is proposed.
Abstract
This paper investigates the impact of pre-existing offline data on online learning, in the context of dynamic pricing. We study a single-product dynamic pricing problem over a selling horizon of periods. The demand in each period is determined by the price of the product according to a linear demand model with unknown parameters. We assume that before the start of the selling horizon, the seller already has some pre-existing offline data. The offline data set contains samples, each of which is an input-output pair consisting of a historical price and an associated demand observation. The seller wants to utilize both the pre-existing offline data and the sequential online data to minimize the regret of the online learning process. We characterize the joint effect of the size, location and dispersion of the offline data on the optimal regret of the online learning process.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Optimization and Search Problems · Machine Learning and Algorithms
