Efficient and Accurate Top-$K$ Recovery from Choice Data
Duc Nguyen

TL;DR
This paper introduces the choice-based Borda count algorithm for efficient and accurate top-$K$ item recovery from choice data, demonstrating optimal sample complexity and practical speed advantages over traditional methods.
Contribution
The paper proposes a novel choice-based Borda count algorithm that achieves optimal sample complexity for top-$K$ recovery under broad models and is faster and simpler than maximum likelihood methods.
Findings
The algorithm matches the top-$K$ estimates of MLE in the limit.
It is several orders of magnitude faster than existing ranking algorithms.
Experiments show competitive accuracy with improved efficiency.
Abstract
The intersection of learning to rank and choice modeling is an active area of research with applications in e-commerce, information retrieval and the social sciences. In some applications such as recommendation systems, the statistician is primarily interested in recovering the set of the top ranked items from a large pool of items as efficiently as possible using passively collected discrete choice data, i.e., the user picks one item from a set of multiple items. Motivated by this practical consideration, we propose the choice-based Borda count algorithm as a fast and accurate ranking algorithm for top -recovery i.e., correctly identifying all of the top items. We show that the choice-based Borda count algorithm has optimal sample complexity for top- recovery under a broad class of random utility models. We prove that in the limit, the choice-based Borda count algorithm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Voting Systems · Bayesian Modeling and Causal Inference · Economic and Environmental Valuation
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
