End-to-End Entity Detection with Proposer and Regressor
Xueru Wen, Changjiang Zhou, Haotian Tang, Luguang Liang, Yu Jiang,, Hong Qi

TL;DR
This paper introduces an end-to-end entity detection model with proposer and regressor components, leveraging rich semantic queries and novel attention mechanisms to improve nested and flat NER performance, achieving state-of-the-art results.
Contribution
It proposes a new end-to-end entity detection framework with proposer and regressor, incorporating spatially modulated attention and progressive refinement for improved nested NER.
Findings
Achieves a new state-of-the-art F1 score of 80.74 on GENIA.
Achieves a new state-of-the-art F1 score of 72.38 on WeiboNER.
Demonstrates superior performance in both flat and nested NER tasks.
Abstract
Named entity recognition is a traditional task in natural language processing. In particular, nested entity recognition receives extensive attention for the widespread existence of the nesting scenario. The latest research migrates the well-established paradigm of set prediction in object detection to cope with entity nesting. However, the manual creation of query vectors, which fail to adapt to the rich semantic information in the context, limits these approaches. An end-to-end entity detection approach with proposer and regressor is presented in this paper to tackle the issues. First, the proposer utilizes the feature pyramid network to generate high-quality entity proposals. Then, the regressor refines the proposals for generating the final prediction. The model adopts encoder-only architecture and thus obtains the advantages of the richness of query semantics, high precision of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Advanced Graph Neural Networks
