Mulco: Recognizing Chinese Nested Named Entities Through Multiple Scopes

Jiuding Yang; Jinwen Luo; Weidong Guo; Jerry Chen; Di Niu; Yu Xu

arXiv:2211.10854·cs.CL·November 22, 2022

Mulco: Recognizing Chinese Nested Named Entities Through Multiple Scopes

Jiuding Yang, Jinwen Luo, Weidong Guo, Jerry Chen, Di Niu, Yu Xu

PDF

Open Access

TL;DR

This paper introduces Mulco, a novel model for recognizing nested Chinese named entities, supported by a new dataset ChiNesE, and demonstrates its superior performance over existing methods.

Contribution

The paper presents Mulco, a new approach for Chinese nested entity recognition using multiple scopes, and releases ChiNesE, a dedicated dataset for this task.

Findings

01

Mulco outperforms baseline methods on ChiNesE.

02

Mulco achieves state-of-the-art results on ACE2005 Chinese corpus.

03

The dataset ChiNesE contains 20,000 sentences with 117,284 entities, 43.8% nested.

Abstract

Nested Named Entity Recognition (NNER) has been a long-term challenge to researchers as an important sub-area of Named Entity Recognition. NNER is where one entity may be part of a longer entity, and this may happen on multiple levels, as the term nested suggests. These nested structures make traditional sequence labeling methods unable to properly recognize all entities. While recent researches focus on designing better recognition methods for NNER in a variety of languages, the Chinese NNER (CNNER) still lacks attention, where a free-for-access, CNNER-specialized benchmark is absent. In this paper, we aim to solve CNNER problems by providing a Chinese dataset and a learning-based model to tackle the issue. To facilitate the research on this task, we release ChiNesE, a CNNER dataset with 20,000 sentences sampled from online passages of multiple domains, containing 117,284 entities…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies