Deep learning from crowds

Filipe Rodrigues; Francisco Pereira

arXiv:1709.01779·stat.ML·December 27, 2017

Deep learning from crowds

Filipe Rodrigues, Francisco Pereira

PDF

3 Repos

TL;DR

This paper introduces a novel end-to-end deep learning approach that effectively learns from noisy, crowdsourced labels by modeling annotator reliability, achieving state-of-the-art results across multiple tasks.

Contribution

It proposes a new crowd layer for deep networks and an EM algorithm to jointly learn network parameters and annotator reliabilities from noisy labels.

Findings

01

Achieves state-of-the-art results on crowdsourced datasets

02

Effectively models annotator reliability and biases

03

Works across classification, regression, and sequence labeling

Abstract

Over the last few years, deep learning has revolutionized the field of machine learning by dramatically improving the state-of-the-art in various domains. However, as the size of supervised artificial neural networks grows, typically so does the need for larger labeled datasets. Recently, crowdsourcing has established itself as an efficient and cost-effective solution for labeling large sets of data in a scalable manner, but it often requires aggregating labels from multiple noisy contributors with different levels of expertise. In this paper, we address the problem of learning deep neural networks from crowds. We begin by describing an EM algorithm for jointly learning the parameters of the network and the reliabilities of the annotators. Then, a novel general-purpose crowd layer is proposed, which allows us to train deep neural networks end-to-end, directly from the noisy labels of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.