Crowdsourcing subjective annotations using pairwise comparisons reduces bias and error compared to the majority-vote method
Hasti Narimanzadeh, Arash Badie-Modiri, Iuliia Smirnova, Ted Hsuan Yun, Chen

TL;DR
This paper presents a new crowdsourcing method using pairwise comparisons and Elo scoring that reduces bias and error in subjective annotations compared to traditional majority voting, supported by theoretical and simulation results.
Contribution
It introduces a novel pipeline combining pairwise comparisons with Elo scoring and provides a theoretical framework and agent-based model to demonstrate its advantages over majority voting.
Findings
Comparison approach yields higher $f_1$ scores under subjectivity.
It reduces bias inflation common in majority voting.
Number of comparisons scales as $O(N \log N)$ with items.
Abstract
How to better reduce measurement variability and bias introduced by subjectivity in crowdsourced labelling remains an open question. We introduce a theoretical framework for understanding how random error and measurement bias enter into crowdsourced annotations of subjective constructs. We then propose a pipeline that combines pairwise comparison labelling with Elo scoring, and demonstrate that it outperforms the ubiquitous majority-voting method in reducing both types of measurement error. To assess the performance of the labelling approaches, we constructed an agent-based model of crowdsourced labelling that lets us introduce different types of subjectivity into the tasks. We find that under most conditions with task subjectivity, the comparison approach produced higher scores. Further, the comparison approach is less susceptible to inflating bias, which majority voting tends to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Auction Theory and Applications · Privacy-Preserving Technologies in Data
