Dynamic Assortment Optimization with Changing Contextual Information
Xi Chen, Yining Wang, Yuan Zhou

TL;DR
This paper introduces a dynamic assortment optimization method that leverages changing product features and contextual information, employing a UCB-based policy with proven regret bounds and practical algorithms.
Contribution
It develops a novel UCB-based policy for non-stationary, feature-dependent choice models with theoretical regret guarantees and efficient heuristics.
Findings
Regret bound of O(d ext{T}) for the proposed policy.
Lower bound of mbda(d ext{T}/K) showing near-optimality.
Numerical studies demonstrate the policy's effectiveness.
Abstract
In this paper, we study the dynamic assortment optimization problem under a finite selling season of length . At each time period, the seller offers an arriving customer an assortment of substitutable products under a cardinality constraint, and the customer makes the purchase among offered products according to a discrete choice model. Most existing work associates each product with a real-valued fixed mean utility and assumes a multinomial logit choice (MNL) model. In many practical applications, feature/contexutal information of products is readily available. In this paper, we incorporate the feature information by assuming a linear relationship between the mean utility and the feature. In addition, we allow the feature information of products to change over time so that the underlying choice model can also be non-stationary. To solve the dynamic assortment optimization under this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Supply Chain and Inventory Management · Optimization and Search Problems
