Decision Tree Design for Classification in Crowdsourcing Systems

Baocheng Geng; Qunwei Li; Pramod K. Varshney

arXiv:1805.00559·cs.LG·May 3, 2018

Decision Tree Design for Classification in Crowdsourcing Systems

Baocheng Geng, Qunwei Li, Pramod K. Varshney

PDF

Open Access

TL;DR

This paper introduces a new sequential decision tree approach for crowdsourcing classification that accounts for worker unreliability, aiming to minimize misclassification probability through novel algorithms and worker assignment strategies.

Contribution

It proposes two algorithms for decision tree design in crowdsourcing, considering worker errors and optimizing the trade-off between cost and accuracy.

Findings

01

Algorithms effectively reduce misclassification probability.

02

Worker assignment strategies improve cost-performance balance.

03

Numerical results validate the proposed methods.

Abstract

In this paper, we present a novel sequential paradigm for classification in crowdsourcing systems. Considering that workers are unreliable and they perform the tests with errors, we study the construction of decision trees so as to minimize the probability of mis-classification. By exploiting the connection between probability of mis-classification and entropy at each level of the decision tree, we propose two algorithms for decision tree design. Furthermore, the worker assignment problem is studied when workers can be assigned to different tests of the decision tree to provide a trade-off between classification cost and resulting error performance. Numerical results are presented for illustration.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMobile Crowdsensing and Crowdsourcing · Data Stream Mining Techniques · Auction Theory and Applications