An Optimal Policy for Dynamic Assortment Planning Under Uncapacitated Multinomial Logit Models
Xi Chen, Yining Wang, Yuan Zhou

TL;DR
This paper introduces an optimal, item-independent policy for dynamic assortment planning under uncapacitated MNL models, achieving minimal regret without assumptions on utilities or revenue separability.
Contribution
It develops a novel trisection-based policy with adaptive confidence bounds that attains the optimal $O( oot T)$ regret, independent of the number of products, extending unimodal bandit theory.
Findings
Achieves $O( oot T)$ regret bound, optimal for the problem.
Regret bound is independent of the number of products $N$.
Policy requires no assumptions on utility estimates or revenue separability.
Abstract
We study the dynamic assortment planning problem, where for each arriving customer, the seller offers an assortment of substitutable products and customer makes the purchase among offered products according to an uncapacitated multinomial logit (MNL) model. Since all the utility parameters of MNL are unknown, the seller needs to simultaneously learn customers' choice behavior and make dynamic decisions on assortments based on the current knowledge. The goal of the seller is to maximize the expected revenue, or equivalently, to minimize the expected regret. Although dynamic assortment planning problem has received an increasing attention in revenue management, most existing policies require the estimation of mean utility for each product and the final regret usually involves the number of products . The optimal regret of the dynamic assortment planning problem under the most basic and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Supply Chain and Inventory Management · Optimization and Search Problems
