Bayesian Crowdsourcing with Constraints

Panagiotis A. Traganitis; Georgios B. Giannakis

arXiv:2012.11048·cs.LG·July 19, 2021

Bayesian Crowdsourcing with Constraints

Panagiotis A. Traganitis, Georgios B. Giannakis

PDF

TL;DR

This paper introduces Bayesian algorithms for semi-supervised crowdsourcing classification, leveraging label and instance-level constraints to improve label aggregation accuracy, validated through analytical and empirical evaluations.

Contribution

It presents novel Bayesian variational inference methods for semi-supervised crowdsourcing with label and pairwise constraints, enhancing label accuracy over unsupervised approaches.

Findings

01

Bayesian methods outperform unsupervised crowdsourcing in accuracy.

02

Constraints significantly improve label aggregation quality.

03

Algorithms are validated on multiple real datasets.

Abstract

Crowdsourcing has emerged as a powerful paradigm for efficiently labeling large datasets and performing various learning tasks, by leveraging crowds of human annotators. When additional information is available about the data, semi-supervised crowdsourcing approaches that enhance the aggregation of labels from human annotators are well motivated. This work deals with semi-supervised crowdsourced classification, under two regimes of semi-supervision: a) label constraints, that provide ground-truth labels for a subset of data; and b) potentially easier to obtain instance-level constraints, that indicate relationships between pairs of data. Bayesian algorithms based on variational inference are developed for each regime, and their quantifiably improved performance, compared to unsupervised crowdsourcing, is analytically and empirically validated on several crowdsourcing datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsVariational Inference