Learning Multinomial Logits in $O(n \log n)$ time
Flavio Chierichetti, Mirko Giacchini, Ravi Kumar, Silvio Lattanzi, Alessandro Panconesi, Erasmo Tani, Andrew Tomkins

TL;DR
This paper presents efficient algorithms for learning the weights of a Multinomial Logit model from query access, achieving near-optimal query complexity with both adaptive and non-adaptive methods, and establishes matching lower bounds.
Contribution
It introduces the first algorithms with provable guarantees for learning MNL weights efficiently, including adaptive and non-adaptive approaches, with tight bounds on query complexity.
Findings
Adaptive algorithm uses $O(n/\
Non-adaptive algorithm uses $O(n^2/\
Lower bounds match the upper bounds up to logarithmic factors.
Abstract
A Multinomial Logit (MNL) model is composed of a finite universe of items , each assigned a positive weight. A query specifies an admissible subset -- called a slate -- and the model chooses one item from that slate with probability proportional to its weight. This query model is also known as the Plackett-Luce model or conditional sampling oracle in the literature. Although MNLs have been studied extensively, a basic computational question remains open: given query access to slates, how efficiently can we learn weights so that, for every slate, the induced choice distribution is within total variation distance of the ground truth? This question is central to MNL learning and has direct implications for modern recommender system interfaces. We provide two algorithms for this task, one with adaptive queries and one with non-adaptive queries. Each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Data Quality and Management · Information Retrieval and Search Behavior
