Efficient Human Computation
Ran Gilad-Bachrach, Aharon Bar-Hillel, Liat Ein-Dor

TL;DR
This paper explores methods for obtaining globally consistent labels from multiple uncoordinated human teachers, analyzing the efficiency of various algorithms and establishing bounds based on the ratio of data instances to classes.
Contribution
It introduces a framework for achieving consistent labels without teacher coordination and analyzes the efficiency of algorithms depending on data-class ratios.
Findings
Efficiency depends on the ratio of data instances per teacher to the number of classes.
Proposes algorithms for distributed labeling with efficiency analysis.
Provides an upper bound on label efficiency for uncoordinated teachers.
Abstract
Collecting large labeled data sets is a laborious and expensive task, whose scaling up requires division of the labeling workload between many teachers. When the number of classes is large, miscorrespondences between the labels given by the different teachers are likely to occur, which, in the extreme case, may reach total inconsistency. In this paper we describe how globally consistent labels can be obtained, despite the absence of teacher coordination, and discuss the possible efficiency of this process in terms of human labor. We define a notion of label efficiency, measuring the ratio between the number of globally consistent labels obtained and the number of labels provided by distributed teachers. We show that the efficiency depends critically on the ratio alpha between the number of data instances seen by a single teacher, and the number of classes. We suggest several algorithms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Optimization and Search Problems
