Reducing Confusion in Active Learning for Part-Of-Speech Tagging

Aditi Chaudhary; Antonios Anastasopoulos; Zaid Sheikh; Graham Neubig

arXiv:2011.00767·cs.CL·November 24, 2020

Reducing Confusion in Active Learning for Part-Of-Speech Tagging

Aditi Chaudhary, Antonios Anastasopoulos, Zaid Sheikh, Graham Neubig

PDF

TL;DR

This paper introduces a novel active learning strategy for POS tagging that focuses on reducing confusion between tag pairs, outperforming existing heuristics across multiple languages.

Contribution

The paper proposes a new active learning approach targeting confusion reduction between tag pairs, improving sample selection for low-resource POS tagging.

Findings

01

Proposed strategy outperforms existing AL heuristics significantly.

02

Proper model calibration via cross-view training enhances AL effectiveness.

03

Selected examples better match oracle data distribution.

Abstract

Active learning (AL) uses a data selection algorithm to select useful training samples to minimize annotation cost. This is now an essential tool for building low-resource syntactic analyzers such as part-of-speech (POS) taggers. Existing AL heuristics are generally designed on the principle of selecting uncertain yet representative training instances, where annotating these instances may reduce a large number of errors. However, in an empirical study across six typologically diverse languages (German, Swedish, Galician, North Sami, Persian, and Ukrainian), we found the surprising result that even in an oracle scenario where we know the true uncertainty of predictions, these current heuristics are far from optimal. Based on this analysis, we pose the problem of AL as selecting instances which maximally reduce the confusion between particular pairs of output tags. Extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.