Gaussian Prior Reinforcement Learning for Nested Named Entity   Recognition

Yawen Yang; Xuming Hu; Fukun Ma; Shu'ang Li; Aiwei Liu; Lijie Wen,; Philip S. Yu

arXiv:2305.07266·cs.CL·May 15, 2023·1 cites

Gaussian Prior Reinforcement Learning for Nested Named Entity Recognition

Yawen Yang, Xuming Hu, Fukun Ma, Shu'ang Li, Aiwei Liu, Lijie Wen,, Philip S. Yu

PDF

Open Access

TL;DR

This paper introduces GPRL, a novel reinforcement learning-based seq2seq model for nested NER that models entity recognition as triplet sequence generation, effectively capturing boundary relations and improving performance.

Contribution

The paper proposes GPRL, a new approach that uses Gaussian priors and reinforcement learning to better model nested entity boundaries and recognition order in NER tasks.

Findings

01

GPRL outperforms previous nested NER models on three datasets.

02

The Gaussian prior effectively models boundary distance distributions.

03

Reinforcement learning helps decouple entity order from gold labels.

Abstract

Named Entity Recognition (NER) is a well and widely studied task in natural language processing. Recently, the nested NER has attracted more attention since its practicality and difficulty. Existing works for nested NER ignore the recognition order and boundary position relation of nested entities. To address these issues, we propose a novel seq2seq model named GPRL, which formulates the nested NER task as an entity triplet sequence generation process. GPRL adopts the reinforcement learning method to generate entity triplets decoupling the entity order in gold labels and expects to learn a reasonable recognition order of entities via trial and error. Based on statistics of boundary distance for nested entities, GPRL designs a Gaussian prior to represent the boundary distance distribution between nested entities and adjust the output probability distribution of nested boundary tokens.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence