Dynamic Pricing and Learning with Long-term Reference Effects
Shipra Agrawal, Wei Tang

TL;DR
This paper studies a dynamic pricing model with long-term reference effects, proposing a near-optimal markdown policy and an efficient online learning algorithm with optimal regret bounds for unknown demand parameters.
Contribution
It introduces a novel reference price mechanism based on average past prices and develops an optimal learning algorithm for unknown demand models in dynamic pricing.
Findings
Markdown policy is near-optimal under the proposed reference mechanism.
Efficient algorithm achieves $ ilde{O}( ootO{T})$ regret for linear demand models.
Detailed characterization of the near-optimal policy for linear demand.
Abstract
We consider a dynamic pricing problem where customer response to the current price is impacted by the customer price expectation, aka reference price. We study a simple and novel reference price mechanism where reference price is the average of the past prices offered by the seller. As opposed to the more commonly studied exponential smoothing mechanism, in our reference price mechanism the prices offered by seller have a longer term effect on the future customer expectations. We show that under this mechanism, a markdown policy is near-optimal irrespective of the parameters of the model. This matches the common intuition that a seller may be better off by starting with a higher price and then decreasing it, as the customers feel like they are getting bargains on items that are ordinarily more expensive. For linear demand models, we also provide a detailed characterization of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsConsumer Market Behavior and Pricing
