CrossNER: Evaluating Cross-Domain Named Entity Recognition

Zihan Liu; Yan Xu; Tiezheng Yu; Wenliang Dai; Ziwei Ji; Samuel; Cahyawijaya; Andrea Madotto; Pascale Fung

arXiv:2012.04373·cs.CL·December 15, 2020·22 cites

CrossNER: Evaluating Cross-Domain Named Entity Recognition

Zihan Liu, Yan Xu, Tiezheng Yu, Wenliang Dai, Ziwei Ji, Samuel, Cahyawijaya, Andrea Madotto, Pascale Fung

PDF

Open Access 5 Repos 4 Datasets 1 Video

TL;DR

This paper introduces CrossNER, a comprehensive cross-domain NER dataset with domain-specific entity types, and evaluates domain-adaptive pre-training strategies to improve cross-domain NER performance, highlighting both progress and ongoing challenges.

Contribution

The paper presents a new fully-labeled cross-domain NER dataset with domain-specific entities and explores effective domain-adaptive pre-training strategies for NER.

Findings

01

Domain-specific entity focus improves NER adaptation.

02

Domain-adaptive pre-training enhances cross-domain NER performance.

03

Challenges remain in fully addressing cross-domain NER difficulties.

Abstract

Cross-domain named entity recognition (NER) models are able to cope with the scarcity issue of NER samples in target domains. However, most of the existing NER benchmarks lack domain-specialized entity types or do not focus on a certain domain, leading to a less effective cross-domain evaluation. To address these obstacles, we introduce a cross-domain NER dataset (CrossNER), a fully-labeled collection of NER data spanning over five diverse domains with specialized entity categories for different domains. Additionally, we also provide a domain-related corpus since using it to continue pre-training language models (domain-adaptive pre-training) is effective for the domain adaptation. We then conduct comprehensive experiments to explore the effectiveness of leveraging different levels of the domain corpus and pre-training strategies to do domain-adaptive pre-training for the cross-domain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

Videos

CrossNER: Evaluating Cross-Domain Named Entity Recognition· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning