A More Efficient Chinese Named Entity Recognition base on BERT and   Syntactic Analysis

Xiao Fu; Guijun Zhang

arXiv:2101.11423·cs.CL·January 28, 2021

A More Efficient Chinese Named Entity Recognition base on BERT and Syntactic Analysis

Xiao Fu, Guijun Zhang

PDF

Open Access

TL;DR

This paper introduces a more efficient Chinese NER method that leverages syntactic analysis and a compressed BERT model, achieving higher accuracy with significantly reduced computational cost.

Contribution

It presents a novel g-BERT model that compresses BERT for Chinese NER, utilizing syntactic information to improve performance and efficiency.

Findings

01

g-BERT reduces calculation by 60%

02

NER performance improves by 2% F1 score

03

Method effectively utilizes POS, CWS, and parsing results

Abstract

We propose a new Named entity recognition (NER) method to effectively make use of the results of Part-of-speech (POS) tagging, Chinese word segmentation (CWS) and parsing while avoiding NER error caused by POS tagging error. This paper first uses Stanford natural language process (NLP) tool to annotate large-scale untagged data so as to reduce the dependence on the tagged data; then a new NLP model, g-BERT model, is designed to compress Bidirectional Encoder Representations from Transformers (BERT) model in order to reduce calculation quantity; finally, the model is evaluated based on Chinese NER dataset. The experimental results show that the calculation quantity in g-BERT model is reduced by 60% and performance improves by 2% with Test F1 to 96.5 compared with that in BERT model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management

MethodsLinear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · WordPiece · Attention Is All You Need · Residual Connection · Dense Connections · Adam · Linear Warmup With Linear Decay · Dropout