Hero-Gang Neural Model For Named Entity Recognition

Jinpeng Hu; Yaling Shen; Yang Liu; Xiang Wan; Tsung-Hui Chang

arXiv:2205.07177·cs.CL·May 17, 2022

Hero-Gang Neural Model For Named Entity Recognition

Jinpeng Hu, Yaling Shen, Yang Liu, Xiang Wan, Tsung-Hui Chang

PDF

Open Access 1 Repo

TL;DR

This paper introduces the Hero-Gang Neural model for NER, combining Transformer-based global context with local feature extraction to improve entity recognition accuracy.

Contribution

It proposes a novel Hero-Gang neural structure that integrates global and local information for enhanced NER performance.

Findings

01

Outperforms existing models on benchmark datasets

02

Effectively combines global and local features

03

Improves recognition of local position information

Abstract

Named entity recognition (NER) is a fundamental and important task in NLP, aiming at identifying named entities (NEs) from free text. Recently, since the multi-head attention mechanism applied in the Transformer model can effectively capture longer contextual information, Transformer-based models have become the mainstream methods and have achieved significant performance in this task. Unfortunately, although these models can capture effective global context information, they are still limited in the local feature and position information extraction, which is critical in NER. In this paper, to address this limitation, we propose a novel Hero-Gang Neural structure (HGN), including the Hero and Gang module, to leverage both global and local information to promote NER. Specifically, the Hero module is composed of a Transformer-based encoder to maintain the advantage of the self-attention…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jinpeng01/hgn
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies

MethodsAttention Is All You Need · Linear Layer · Dropout · Adam · Byte Pair Encoding · Residual Connection · Label Smoothing · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Layer Normalization