Optimal Inference in Crowdsourced Classification via Belief Propagation

Jungseul Ok; Sewoong Oh; Jinwoo Shin; Yung Yi

arXiv:1602.03619·cs.LG·January 13, 2017·1 cites

Optimal Inference in Crowdsourced Classification via Belief Propagation

Jungseul Ok, Sewoong Oh, Jinwoo Shin, Yung Yi

PDF

Open Access

TL;DR

This paper demonstrates that Belief Propagation achieves the fundamental limit of label inference accuracy in crowdsourced classification under the Dawid-Skene model, outperforming existing algorithms.

Contribution

It introduces a new lower bound on the fundamental limit and proves BP's optimality, bridging the gap between theory and practical performance.

Findings

01

BP matches the fundamental limit of inference accuracy.

02

BP outperforms state-of-the-art algorithms in experiments.

03

Theoretical analysis confirms BP's optimality across regimes.

Abstract

Crowdsourcing systems are popular for solving large-scale labelling tasks with low-paid workers. We study the problem of recovering the true labels from the possibly erroneous crowdsourced labels under the popular Dawid-Skene model. To address this inference problem, several algorithms have recently been proposed, but the best known guarantee is still significantly larger than the fundamental limit. We close this gap by introducing a tighter lower bound on the fundamental limit and proving that Belief Propagation (BP) exactly matches this lower bound. The guaranteed optimality of BP is the strongest in the sense that it is information-theoretically impossible for any other algorithm to correctly label a larger fraction of the tasks. Experimental results suggest that BP is close to optimal for all regimes considered and improves upon competing state-of-the-art algorithms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMobile Crowdsensing and Crowdsourcing · Auction Theory and Applications · Privacy-Preserving Technologies in Data