Reliable Crowdsourcing for Multi-Class Labeling using Coding Theory

Aditya Vempaty; Lav R. Varshney; Pramod K. Varshney

arXiv:1309.3330·cs.IT·June 17, 2015

Reliable Crowdsourcing for Multi-Class Labeling using Coding Theory

Aditya Vempaty, Lav R. Varshney, Pramod K. Varshney

PDF

TL;DR

This paper introduces a coding-theory-based approach to improve the reliability of multi-class labeling in crowdsourcing systems, effectively handling unreliable workers and enhancing classification accuracy.

Contribution

It applies error-control coding and decoding algorithms to crowdsourcing, providing a novel framework for reliable multi-class labeling despite worker unreliability.

Findings

01

Coding-based schemes outperform majority voting.

02

Pairing workers and diversifying questions improve accuracy.

03

Effective across different crowd models.

Abstract

Crowdsourcing systems often have crowd workers that perform unreliable work on the task they are assigned. In this paper, we propose the use of error-control codes and decoding algorithms to design crowdsourcing systems for reliable classification despite unreliable crowd workers. Coding-theory based techniques also allow us to pose easy-to-answer binary questions to the crowd workers. We consider three different crowdsourcing models: systems with independent crowd workers, systems with peer-dependent reward schemes, and systems where workers have common sources of information. For each of these models, we analyze classification performance with the proposed coding-based scheme. We develop an ordering principle for the quality of crowds and describe how system performance changes with the quality of the crowd. We also show that pairing among workers and diversification of the questions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.