Logarithmic Regret in Feature-based Dynamic Pricing
Jianyu Xu, Yu-Xiang Wang (Computer Science Department, UC Santa, Barbara)

TL;DR
This paper introduces two algorithms for feature-based dynamic pricing that achieve optimal logarithmic regret bounds in both stochastic and adversarial settings, significantly improving upon previous results.
Contribution
It provides the first algorithms with optimal $O(d\,\log T)$ regret bounds for feature-based dynamic pricing in both stochastic and adversarial models.
Findings
Achieved $O(d\log T)$ regret bounds for stochastic and adversarial settings.
Improved previous regret bounds from polynomial to logarithmic scale.
Proved an $\\Omega(\sqrt{T})$ lower bound showing the benefit of demand curve knowledge.
Abstract
Feature-based dynamic pricing is an increasingly popular model of setting prices for highly differentiated products with applications in digital marketing, online sales, real estate and so on. The problem was formally studied as an online learning problem [Javanmard & Nazerzadeh, 2019] where a seller needs to propose prices on the fly for a sequence of products based on their features while having a small regret relative to the best -- "omniscient" -- pricing strategy she could have come up with in hindsight. We revisit this problem and provide two algorithms (EMLP and ONSP) for stochastic and adversarial feature settings, respectively, and prove the optimal regret bounds for both. In comparison, the best existing results are and respectively, with being the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Auction Theory and Applications
