Finetuning BERT on Partially Annotated NER Corpora

Viktor Scherbakov; Vladimir Mayorov

arXiv:2211.14360·cs.CL·November 29, 2022

Finetuning BERT on Partially Annotated NER Corpora

Viktor Scherbakov, Vladimir Mayorov

PDF

Open Access 1 Repo

TL;DR

This paper proposes a method for finetuning BERT on partially annotated NER datasets using self-supervision and label preprocessing, significantly improving performance when labels are sparse.

Contribution

It introduces a novel approach that enables effective finetuning of BERT on partially labelled datasets, outperforming previous LSTM-based methods.

Findings

01

Outperforms previous LSTM-based label preprocessing baseline.

02

Achieves baseline performance with only 10% of entities labelled.

03

Demonstrates effective finetuning of RoBERTa on sparse annotations.

Abstract

Most Named Entity Recognition (NER) models operate under the assumption that training datasets are fully labelled. While it is valid for established datasets like CoNLL 2003 and OntoNotes, sometimes it is not feasible to obtain the complete dataset annotation. These situations may occur, for instance, after selective annotation of entities for cost reduction. This work presents an approach to finetuning BERT on such partially labelled datasets using self-supervision and label preprocessing. Our approach outperforms the previous LSTM-based label preprocessing baseline, significantly improving the performance on poorly labelled datasets. We demonstrate that following our approach while finetuning RoBERTa on CoNLL 2003 dataset with only 10% of total entities labelled is enough to reach the performance of the baseline trained on the same dataset with 50% of the entities labelled.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

viktorooreps/guided-bond
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Weight Decay · Residual Connection · Dense Connections · Layer Normalization · WordPiece · Linear Warmup With Linear Decay