Optimal Exploration of New Products under Assortment Decisions
Jackie Baek, Atanas Dinev, Thodoris Lykouris

TL;DR
This paper analyzes optimal strategies for online learning and assortment decisions for new products under capacity constraints, balancing exploration costs and social learning effects.
Contribution
It characterizes optimal assortment strategies for exploring new products, revealing when to bundle with incumbents and how to manage multiple new products simultaneously.
Findings
Pairing new products with top incumbents is always optimal.
The optimal number of new products to explore simultaneously depends on their potential, not individual purchase probabilities.
Standard bandit algorithms like UCB and Thompson Sampling are ineffective in this setting.
Abstract
We study online learning for new products on a platform that makes capacity-constrained assortment decisions on which products to offer. For a newly listed product, its quality is initially unknown, and quality information propagates through social learning: when a customer purchases a new product and leaves a review, its quality is revealed to both the platform and future customers. Since reviews require purchases, the platform must feature new products in the assortment ("explore") to generate reviews to learn about new products. Such exploration is costly because customer demand for new products is lower than for incumbent products. We characterize the optimal assortments for exploration to minimize regret, addressing two questions. (1) Should the platform offer a new product alone or alongside incumbent products? The former maximizes the purchase probability of the new product but…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
