Evaluating the Crowd with Confidence

Manas Joglekar; Hector Garcia-Molina; Aditya Parameswaran

arXiv:1411.6562·cs.DB·November 25, 2014

Evaluating the Crowd with Confidence

Manas Joglekar, Hector Garcia-Molina, Aditya Parameswaran

PDF

Open Access

TL;DR

This paper introduces methods to generate confidence intervals for worker error rates in crowdsourcing, improving quality control by enabling better evaluation and management of worker performance.

Contribution

It presents novel techniques for calculating confidence intervals for worker error rates, applicable across various datasets and used for worker eviction and answer accuracy assessment.

Findings

01

Correct confidence intervals are generated on real-world datasets.

02

Techniques effectively identify poorly performing workers.

03

Confidence intervals improve evaluation of answer accuracy.

Abstract

Worker quality control is a crucial aspect of crowdsourcing systems; typically occupying a large fraction of the time and money invested on crowdsourcing. In this work, we devise techniques to generate confidence intervals for worker error rate estimates, thereby enabling a better evaluation of worker quality. We show that our techniques generate correct confidence intervals on a range of real-world datasets, and demonstrate wide applicability by using them to evict poorly performing workers, and provide confidence intervals on the accuracy of the answers.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMobile Crowdsensing and Crowdsourcing · Anomaly Detection Techniques and Applications · Data Stream Mining Techniques