Improving NER's Performance with Massive financial corpus

Han Zhang

arXiv:2007.15871·cs.CL·August 3, 2020·1 cites

Improving NER's Performance with Massive financial corpus

Han Zhang

PDF

Open Access 1 Repo

TL;DR

This paper presents a method to improve Named Entity Recognition (NER) performance in financial texts by pre-training lightweight models with knowledge distillation and multi-stage learning, achieving higher recall and faster training.

Contribution

It introduces a novel approach combining pre-training, knowledge distillation, and multi-stage learning to enhance NER in financial domains with limited data.

Findings

01

Recall increased by nearly 20 points

02

Training speed was four times faster than BERT-CRF

03

Effective in low-quality, small-scale datasets

Abstract

Training large deep neural networks needs massive high quality annotation data, but the time and labor costs are too expensive for small business. We start a company-name recognition task with a small scale and low quality training data, then using skills to enhanced model training speed and predicting performance with minimum labor cost. The methods we use involve pre-training a lite language model such as Albert-small or Electra-small in financial corpus, knowledge of distillation and multi-stage learning. The result is that we raised the recall rate by nearly 20 points and get 4 times as fast as BERT-CRF model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Hanlard/Electra_CRF_NER
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Stock Market Forecasting Methods