Using Error Decay Prediction to Overcome Practical Issues of Deep Active   Learning for Named Entity Recognition

Haw-Shiuan Chang; Shankar Vembu; Sunil Mohan; Rheeya Uppaal; Andrew; McCallum

arXiv:1911.07335·cs.CL·July 22, 2020

Using Error Decay Prediction to Overcome Practical Issues of Deep Active Learning for Named Entity Recognition

Haw-Shiuan Chang, Shankar Vembu, Sunil Mohan, Rheeya Uppaal, Andrew, McCallum

PDF

TL;DR

This paper introduces a transparent batch active sampling framework for deep active learning in NER, addressing practical issues like black-box model use, noise robustness, and transparency, and demonstrating improved performance and insights.

Contribution

It proposes a novel error decay prediction-based sampling method that enhances robustness and transparency in deep active learning for NER tasks.

Findings

01

Outperforms diversification-based methods for black-box NER models

02

Increases robustness to labeling noise when combined with uncertainty sampling

03

Provides insights into when different active sampling strategies are effective

Abstract

Existing deep active learning algorithms achieve impressive sampling efficiency on natural language processing tasks. However, they exhibit several weaknesses in practice, including (a) inability to use uncertainty sampling with black-box models, (b) lack of robustness to labeling noise, and (c) lack of transparency. In response, we propose a transparent batch active sampling framework by estimating the error decay curves of multiple feature-defined subsets of the data. Experiments on four named entity recognition (NER) tasks demonstrate that the proposed methods significantly outperform diversification-based methods for black-box NER taggers, and can make the sampling process more robust to labeling noise when combined with uncertainty-based methods. Furthermore, the analysis of experimental results sheds light on the weaknesses of different active sampling strategies, and when…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.