A Technical Survey on Statistical Modelling and Design Methods for Crowdsourcing Quality Control
Yuan Jin, Mark Carman, Ye Zhu, Yong Xiang

TL;DR
This paper provides a comprehensive survey that unifies statistical modeling and mechanism design approaches for crowdsourcing quality control, offering technical insights, taxonomies, and future research directions.
Contribution
It is the first survey to systematically connect and detail the interplay between statistical models and mechanism design in crowdsourcing quality control.
Findings
Unified framework for response quality assessment
Taxonomies of quality control methods
Identified limitations and future research directions
Abstract
Online crowdsourcing provides a scalable and inexpensive means to collect knowledge (e.g. labels) about various types of data items (e.g. text, audio, video). However, it is also known to result in large variance in the quality of recorded responses which often cannot be directly used for training machine learning systems. To resolve this issue, a lot of work has been conducted to control the response quality such that low-quality responses cannot adversely affect the performance of the machine learning systems. Such work is referred to as the quality control for crowdsourcing. Past quality control research can be divided into two major branches: quality control mechanism design and statistical models. The first branch focuses on designing measures, thresholds, interfaces and workflows for payment, gamification, question assignment and other mechanisms that influence workers' behaviour.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Data Stream Mining Techniques · Auction Theory and Applications
