Meta Dynamic Pricing: Transfer Learning Across Experiments

Hamsa Bastani; David Simchi-Levi; Ruihao Zhu

arXiv:1902.10918·cs.LG·January 7, 2021·24 cites

Meta Dynamic Pricing: Transfer Learning Across Experiments

Hamsa Bastani, David Simchi-Levi, Ruihao Zhu

PDF

Open Access

TL;DR

This paper introduces a meta learning approach for dynamic pricing across multiple related products, efficiently learning shared demand structures to improve pricing strategies and reduce regret in experiment-rich environments.

Contribution

It proposes a novel meta dynamic pricing algorithm that learns a shared prior online, balancing exploration and exploitation, with a new prior alignment technique for analyzing regret with mis-specified priors.

Findings

01

Meta regret grows sublinearly with number of products N.

02

Algorithm significantly speeds up learning compared to prior-independent methods.

03

Demonstrated effectiveness on synthetic and real auto loan data.

Abstract

We study the problem of learning shared structure \emph{across} a sequence of dynamic pricing experiments for related products. We consider a practical formulation where the unknown demand parameters for each product come from an unknown distribution (prior) that is shared across products. We then propose a meta dynamic pricing algorithm that learns this prior online while solving a sequence of Thompson sampling pricing experiments (each with horizon $T$ ) for $N$ different products. Our algorithm addresses two challenges: (i) balancing the need to learn the prior (\emph{meta-exploration}) with the need to leverage the estimated prior to achieve good performance (\emph{meta-exploitation}), and (ii) accounting for uncertainty in the estimated prior by appropriately "widening" the estimated prior as a function of its estimation error. We introduce a novel prior alignment technique to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Machine Learning and Algorithms