Nested Named Entity Recognition as Latent Lexicalized Constituency   Parsing

Chao Lou; Songlin Yang; Kewei Tu

arXiv:2203.04665·cs.CL·March 10, 2022·1 cites

Nested Named Entity Recognition as Latent Lexicalized Constituency Parsing

Chao Lou, Songlin Yang, Kewei Tu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel nested NER approach using lexicalized constituency trees with headword annotations, leveraging efficient algorithms and additional training strategies to achieve state-of-the-art results.

Contribution

It proposes a new nested NER model based on lexicalized constituency parsing, incorporating headword information and novel training strategies for improved performance.

Findings

01

Achieves state-of-the-art results on ACE2004, ACE2005, and NNE datasets.

02

Demonstrates competitive performance on GENIA dataset.

03

Maintains fast inference speed.

Abstract

Nested named entity recognition (NER) has been receiving increasing attention. Recently, (Fu et al, 2021) adapt a span-based constituency parser to tackle nested NER. They treat nested entities as partially-observed constituency trees and propose the masked inside algorithm for partial marginalization. However, their method cannot leverage entity heads, which have been shown useful in entity mention detection and entity typing. In this work, we resort to more expressive structures, lexicalized constituency trees in which constituents are annotated by headwords, to model nested entities. We leverage the Eisner-Satta algorithm to perform partial marginalization and inference efficiently. In addition, we propose to use (1) a two-stage strategy (2) a head regularization loss and (3) a head-aware labeling loss in order to enhance the performance. We make a thorough ablation study to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

louchao98/nner_as_parsing
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management