Contextual Online Pricing with (Biased) Offline Data

Yixuan Zhang; Ruihao Zhu; Qiaomin Xie

arXiv:2507.02762·cs.LG·July 4, 2025

Contextual Online Pricing with (Biased) Offline Data

Yixuan Zhang, Ruihao Zhu, Qiaomin Xie

PDF

TL;DR

This paper develops optimal online pricing algorithms that leverage biased offline data, providing tight regret bounds and extending to stochastic linear bandits, addressing a key challenge in data-driven pricing strategies.

Contribution

It introduces instance-dependent regret bounds for contextual pricing with biased offline data and proposes algorithms that achieve these bounds, including a robust variant for unknown bias.

Findings

01

Optimal regret bounds for scalar price elasticity case.

02

Extension of bounds to general price elasticity.

03

Robust algorithm for unknown bias scenarios.

Abstract

We study contextual online pricing with biased offline data. For the scalar price elasticity case, we identify the instance-dependent quantity $δ^{2}$ that measures how far the offline data lies from the (unknown) online optimum. We show that the time length $T$ , bias bound $V$ , size $N$ and dispersion $λ_{m i n} (\hat{Σ})$ of the offline data, and $δ^{2}$ jointly determine the statistical complexity. An Optimism-in-the-Face-of-Uncertainty (OFU) policy achieves a minimax-optimal, instance-dependent regret bound $\tilde{O} (d T \land (V^{2} T + \frac{d T}{λ _{m i n} ( Σ ^ ) + ( N \land T ) δ ^{2}}))$ . For general price elasticity, we establish a worst-case, minimax-optimal rate $\tilde{O} (d T \land (V^{2} T + \frac{d T}{λ _{m i n} ( Σ ^ )}))$ and provide a generalized OFU algorithm that attains it. When the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.