Adapting Coreference Resolution Models through Active Learning

Michelle Yuan; Patrick Xia; Chandler May; Benjamin Van Durme; Jordan; Boyd-Graber

arXiv:2104.07611·cs.CL·March 30, 2022

Adapting Coreference Resolution Models through Active Learning

Michelle Yuan, Patrick Xia, Chandler May, Benjamin Van Durme, Jordan, Boyd-Graber

PDF

Open Access 1 Repo

TL;DR

This paper investigates active learning strategies for neural coreference resolution, focusing on uncertainty sampling and annotation efficiency, to improve model transferability across domains.

Contribution

It introduces a detailed analysis of active learning methods for coreference resolution, highlighting effective span annotation strategies within documents.

Findings

01

Span annotation within the same document is more effective.

02

Uncertainty sampling improves annotation efficiency.

03

Error analysis guides better active learning strategies.

Abstract

Neural coreference resolution models trained on one dataset may not transfer to new, low-resource domains. Active learning mitigates this problem by sampling a small subset of data for annotators to label. While active learning is well-defined for classification tasks, its application to coreference resolution is neither well-defined nor fully understood. This paper explores how to actively label coreference, examining sources of model uncertainty and document reading costs. We compare uncertainty sampling strategies and their advantages through thorough error analysis. In both synthetic and human experiments, labeling spans within the same document is more effective than annotating spans across documents. The findings contribute to a more realistic development of coreference resolution models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

forest-snow/incremental-coref
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Neural Networks and Applications · Machine Learning in Healthcare