Leveraging Crowdsourcing Data For Deep Active Learning - An Application:   Learning Intents in Alexa

Jie Yang; Thomas Drake; Andreas Damianou; Yoelle Maarek

arXiv:1803.04223·cs.LG·March 13, 2018

Leveraging Crowdsourcing Data For Deep Active Learning - An Application: Learning Intents in Alexa

Jie Yang, Thomas Drake, Andreas Damianou, Yoelle Maarek

PDF

TL;DR

This paper introduces a Bayesian framework that improves deep active learning from crowdsourced annotations by modeling annotator expertise and reducing annotation needs, demonstrated on Alexa intent classification.

Contribution

It extends Bayesian deep learning to incorporate targeted crowdsourcing, learning annotator expertise, and optimizing annotation efficiency for deep models.

Findings

01

Accurately learns annotator expertise and true labels

02

Reduces annotation requirements compared to existing methods

03

Effective in intent classification for Alexa

Abstract

This paper presents a generic Bayesian framework that enables any deep learning model to actively learn from targeted crowds. Our framework inherits from recent advances in Bayesian deep learning, and extends existing work by considering the targeted crowdsourcing approach, where multiple annotators with unknown expertise contribute an uncontrolled amount (often limited) of annotations. Our framework leverages the low-rank structure in annotations to learn individual annotator expertise, which then helps to infer the true labels from noisy and sparse annotations. It provides a unified Bayesian model to simultaneously infer the true labels and train the deep learning model in order to reach an optimal learning efficacy. Finally, our framework exploits the uncertainty of the deep learning model during prediction as well as the annotators' estimated expertise to minimize the number of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.