Active Algorithms For Preference Learning Problems with Multiple   Populations

Aniruddha Bhargava; Ravi Ganti; Robert Nowak

arXiv:1603.04118·stat.ML·June 23, 2016·1 cites

Active Algorithms For Preference Learning Problems with Multiple Populations

Aniruddha Bhargava, Ravi Ganti, Robert Nowak

PDF

Open Access

TL;DR

This paper introduces active algorithms for preference learning across multiple populations, enabling adaptive pairwise comparisons with theoretical guarantees and experimental validation.

Contribution

It presents novel adaptive algorithms with provable sample complexity for preference learning in heterogeneous populations, including new Nyström-like methods.

Findings

01

Algorithms are computationally efficient.

02

Sample complexity guarantees are established for noiseless and noisy cases.

03

Experimental results demonstrate effectiveness.

Abstract

In this paper we model the problem of learning preferences of a population as an active learning problem. We propose an algorithm can adaptively choose pairs of items to show to users coming from a heterogeneous population, and use the obtained reward to decide which pair of items to show next. We provide computationally efficient algorithms with provable sample complexity guarantees for this problem in both the noiseless and noisy cases. In the process of establishing sample complexity guarantees for our algorithms, we establish new results using a Nystr{\"o}m-like method which can be of independent interest. We supplement our theoretical results with experimental comparisons.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Metaheuristic Optimization Algorithms Research