Global Pointer: Novel Efficient Span-based Approach for Named Entity Recognition
Jianlin Su, Ahmed Murtadha, Shengfeng Pan, Jing Hou, Jun Sun, Wanwei, Huang, Bo Wen, Yunfeng Liu

TL;DR
The paper introduces Global Pointer, a novel span-based NER framework that efficiently captures semantic information using a global view and attention mechanisms, outperforming existing methods on benchmark datasets.
Contribution
It proposes a new span-based NER model with a global attention mechanism, a novel loss function, and an efficient training method, improving accuracy and computational efficiency.
Findings
Outperforms existing span-based NER methods on benchmarks.
The new loss function effectively addresses label imbalance.
The model demonstrates improved accuracy and efficiency.
Abstract
Named entity recognition (NER) task aims at identifying entities from a piece of text that belong to predefined semantic types such as person, location, organization, etc. The state-of-the-art solutions for flat entities NER commonly suffer from capturing the fine-grained semantic information in underlying texts. The existing span-based approaches overcome this limitation, but the computation time is still a concern. In this work, we propose a novel span-based NER framework, namely Global Pointer (GP), that leverages the relative positions through a multiplicative attention mechanism. The ultimate goal is to enable a global view that considers the beginning and the end positions to predict the entity. To this end, we design two modules to identify the head and the tail of a given entity to enable the inconsistency between the training and inference processes. Moreover, we introduce a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
MethodsSoftmax · Multiplicative Attention
