Active Algorithms For Preference Learning Problems with Multiple Populations
Aniruddha Bhargava, Ravi Ganti, Robert Nowak

TL;DR
This paper introduces active algorithms for preference learning across multiple populations, enabling adaptive pairwise comparisons with theoretical guarantees and experimental validation.
Contribution
It presents novel adaptive algorithms with provable sample complexity for preference learning in heterogeneous populations, including new Nyström-like methods.
Findings
Algorithms are computationally efficient.
Sample complexity guarantees are established for noiseless and noisy cases.
Experimental results demonstrate effectiveness.
Abstract
In this paper we model the problem of learning preferences of a population as an active learning problem. We propose an algorithm can adaptively choose pairs of items to show to users coming from a heterogeneous population, and use the obtained reward to decide which pair of items to show next. We provide computationally efficient algorithms with provable sample complexity guarantees for this problem in both the noiseless and noisy cases. In the process of establishing sample complexity guarantees for our algorithms, we establish new results using a Nystr{\"o}m-like method which can be of independent interest. We supplement our theoretical results with experimental comparisons.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Metaheuristic Optimization Algorithms Research
