Optimal Semiparametric Dynamic Pricing with Feature Diversity
Jinhang Chai, Yaqi Duan, Jianqing Fan, Kaizheng Wang

TL;DR
This paper introduces a stagewise greedy algorithm for semiparametric dynamic pricing that leverages feature diversity and endogenous samples to achieve optimal regret rates, improving upon prior methods.
Contribution
The authors develop a novel iterative pricing algorithm that refines demand estimation using local polynomial regression, achieving optimal regret bounds in semiparametric pricing models.
Findings
Regret scales as T^{max{1/2, 3/(2β+1)}} for linear utility classes.
The method attains the parametric √T rate when β ≥ 5/2.
Numerical experiments confirm the theoretical advantages and practical effectiveness.
Abstract
We study contextual dynamic pricing under a semiparametric demand model in which the purchase probability is , where captures mean utility as a function of product features and buyer covariates, and is an unknown market-noise distribution. Existing methods either incur suboptimal regret or rely on restrictive structural assumptions. We propose a stagewise greedy pricing algorithm that iteratively refines the estimate of via local polynomial regression while pricing greedily with current estimates. By exploiting feature diversity, the algorithm reuses endogenous samples collected during exploitation for nonparametric estimation, avoiding costly global random exploration used in prior work. We establish a general regret bound that applies to any estimator of the utility function, and derive explicit rates for linear, nonparametric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
