On Dynamic Pricing with Covariates
Hanzhao Wang, Kalyan Talluri, Xiaocheng Li

TL;DR
This paper develops algorithms for dynamic pricing with covariates that achieve near-optimal regret bounds without assuming i.i.d. covariates, extending the theory to more general and realistic settings.
Contribution
It introduces UCB and Thompson sampling algorithms for dynamic pricing with covariates that do not require i.i.d. assumptions and match lower bounds, improving upon prior work.
Findings
Achieve $O(d ext{ } \sqrt{T} ext{ }\log T)$ regret bounds without i.i.d. assumptions.
Regret bounds are independent of the covariance matrix's eigenvalues.
Demonstrated effectiveness through numerical experiments.
Abstract
We consider dynamic pricing with covariates under a generalized linear demand model: a seller can dynamically adjust the price of a product over a horizon of time periods, and at each time period , the demand of the product is jointly determined by the price and an observable covariate vector through a generalized linear model with unknown co-efficients. Most of the existing literature assumes the covariate vectors 's are independently and identically distributed (i.i.d.); the few papers that relax this assumption either sacrifice model generality or yield sub-optimal regret bounds. In this paper, we show that UCB and Thompson sampling-based pricing algorithms can achieve an regret upper bound without assuming any statistical structure on the covariates . Our upper bound on the regret matches the lower bound up to logarithmic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Supply Chain and Inventory Management · Decision-Making and Behavioral Economics
