A Worker-Task Specialization Model for Crowdsourcing: Efficient Inference and Fundamental Limits
Doyeon Kim, Jeonghwan Lee, Hye Won Chung

TL;DR
This paper introduces a new worker-task specialization model for crowdsourcing that accounts for varying worker reliability across different task types, providing optimal inference algorithms and demonstrating improved performance over existing methods.
Contribution
The paper proposes a novel $d$-type specialization model for crowdsourcing, characterizes its optimal sample complexity, and develops algorithms that achieve these limits even with unknown worker and task types.
Findings
Algorithms achieve order-wise optimal label inference.
Model outperforms existing algorithms on synthetic and real data.
Handles scalable and heterogeneous worker-task type scenarios.
Abstract
Crowdsourcing system has emerged as an effective platform for labeling data with relatively low cost by using non-expert workers. Inferring correct labels from multiple noisy answers on data, however, has been a challenging problem, since the quality of the answers varies widely across tasks and workers. Many existing works have assumed that there is a fixed ordering of workers in terms of their skill levels, and focused on estimating worker skills to aggregate the answers from workers with different weights. In practice, however, the worker skill changes widely across tasks, especially when the tasks are heterogeneous. In this paper, we consider a new model, called -type specialization model, in which each task and worker has its own (unknown) type and the reliability of each worker can vary in the type of a given task and that of a worker. We allow that the number of types can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Indoor and Outdoor Localization Technologies · Data Stream Mining Techniques
