Error Rate Bounds and Iterative Weighted Majority Voting for Crowdsourcing
Hongwei Li, Bin Yu

TL;DR
This paper derives finite-sample error bounds for various aggregation rules in crowdsourcing, introduces an efficient iterative weighted majority voting method, and demonstrates its effectiveness and computational efficiency through experiments.
Contribution
It provides finite-sample exponential error bounds for general aggregation rules under the Dawid-Skene model and proposes a simple, fast iterative weighted majority voting method that approximates the oracle MAP rule.
Findings
IWMV performs comparably to state-of-the-art methods.
IWMV is significantly faster, about 100 times more efficient.
Theoretical guarantees are established for the one-step version of IWMV.
Abstract
Crowdsourcing has become an effective and popular tool for human-powered computation to label large datasets. Since the workers can be unreliable, it is common in crowdsourcing to assign multiple workers to one task, and to aggregate the labels in order to obtain results of high quality. In this paper, we provide finite-sample exponential bounds on the error rate (in probability and in expectation) of general aggregation rules under the Dawid-Skene crowdsourcing model. The bounds are derived for multi-class labeling, and can be used to analyze many aggregation methods, including majority voting, weighted majority voting and the oracle Maximum A Posteriori (MAP) rule. We show that the oracle MAP rule approximately optimizes our upper bound on the mean error rate of weighted majority voting in certain setting. We propose an iterative weighted majority voting (IWMV) method that optimizes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Auction Theory and Applications · Privacy-Preserving Technologies in Data
