Inferring object rankings based on noisy pairwise comparisons from multiple annotators
Rahul Gupta, Shrikanth Narayanan

TL;DR
This paper introduces EM-based algorithms for inferring object rankings from noisy, multi-annotator pairwise comparisons, accounting for annotator quality and object attributes to improve ground truth estimation.
Contribution
The work develops novel EM algorithms that incorporate object attributes and variable annotator reliability to accurately infer rankings from noisy crowd-sourced data.
Findings
Algorithms perform well on synthetic and real datasets.
Annotator quality significantly impacts ranking accuracy.
Variable flip probability improves inference in complex cases.
Abstract
Ranking a set of objects involves establishing an order allowing for comparisons between any pair of objects in the set. Oftentimes, due to the unavailability of a ground truth of ranked orders, researchers resort to obtaining judgments from multiple annotators followed by inferring the ground truth based on the collective knowledge of the crowd. However, the aggregation is often ad-hoc and involves imposing stringent assumptions in inferring the ground truth (e.g. majority vote). In this work, we propose Expectation-Maximization (EM) based algorithms that rely on the judgments from multiple annotators and the object attributes for inferring the latent ground truth. The algorithm learns the relation between the latent ground truth and object attributes as well as annotator specific probabilities of flipping, a metric to assess annotator quality. We further extend the EM algorithm to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Auction Theory and Applications · Privacy-Preserving Technologies in Data
