Adaptively Learning the Crowd Kernel
Omer Tamuz, Ce Liu, Serge Belongie, Ohad Shamir, Adam Tauman Kalai

TL;DR
This paper presents an adaptive algorithm that learns a comprehensive similarity matrix from crowdsourced triplet queries, enabling effective embedding of objects into Euclidean space for various domain-specific features.
Contribution
The paper introduces a novel adaptive sampling algorithm that efficiently learns a crowd-based similarity kernel from triplet queries, improving over non-adaptive methods.
Findings
The crowd kernel accurately captures domain-specific features.
Adaptive sampling reduces the number of queries needed.
SVM analysis confirms the kernel's effectiveness.
Abstract
We introduce an algorithm that, given n objects, learns a similarity matrix over all n^2 pairs, from crowdsourced data alone. The algorithm samples responses to adaptively chosen triplet-based relative-similarity queries. Each query has the form "is object 'a' more similar to 'b' or to 'c'?" and is chosen to be maximally informative given the preceding responses. The output is an embedding of the objects into Euclidean space (like MDS); we refer to this as the "crowd kernel." SVMs reveal that the crowd kernel captures prominent and subtle features across a number of domains, such as "is striped" among neckties and "vowel vs. consonant" among letters.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Data Stream Mining Techniques
