NER-BERT: A Pre-trained Model for Low-Resource Entity Tagging

Zihan Liu; Feijun Jiang; Yuxiang Hu; Chen Shi; Pascale Fung

arXiv:2112.00405·cs.CL·December 2, 2021·31 cites

NER-BERT: A Pre-trained Model for Low-Resource Entity Tagging

Zihan Liu, Feijun Jiang, Yuxiang Hu, Chen Shi, Pascale Fung

PDF

Open Access

TL;DR

This paper introduces NER-BERT, a pre-trained language model specifically designed for low-resource named entity recognition, demonstrating significant improvements over existing models across multiple domains.

Contribution

The paper constructs a high-quality, large-scale NER dataset and pre-trains NER-BERT, addressing the gap in low-resource NER performance and task-specific pre-training.

Findings

01

NER-BERT outperforms BERT and other baselines in low-resource settings

02

The model shows strong performance across nine diverse domains

03

Entity representations visualization confirms effectiveness

Abstract

Named entity recognition (NER) models generally perform poorly when large training datasets are unavailable for low-resource domains. Recently, pre-training a large-scale language model has become a promising direction for coping with the data scarcity issue. However, the underlying discrepancies between the language modeling and NER task could limit the models' performance, and pre-training for the NER task has rarely been studied since the collected NER datasets are generally small or large but with low quality. In this paper, we construct a massive NER corpus with a relatively high quality, and we pre-train a NER-BERT model based on the created dataset. Experimental results show that our pre-trained model can significantly outperform BERT as well as other strong baselines in low-resource scenarios across nine diverse domains. Moreover, a visualization of entity representations…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Data Quality and Management · Natural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Dropout · Linear Warmup With Linear Decay · WordPiece · Layer Normalization · Weight Decay · Dense Connections · Multi-Head Attention