Named Entity Recognition with stack residual LSTM and trainable bias   decoding

Quan Tran; Andrew MacKinlay; Antonio Jimeno Yepes

arXiv:1706.07598·cs.CL·July 12, 2017·24 cites

Named Entity Recognition with stack residual LSTM and trainable bias decoding

Quan Tran, Andrew MacKinlay, Antonio Jimeno Yepes

PDF

Open Access

TL;DR

This paper introduces residual connections in stacked RNNs and a bias decoding mechanism to enhance NER performance, achieving state-of-the-art results on CoNLL 2003 for English and Spanish.

Contribution

It proposes residual connections for deep RNNs and a bias decoding method to optimize non-differentiable objectives in NER models.

Findings

01

Improved NER accuracy on CoNLL 2003 dataset

02

Achieved state-of-the-art results for English and Spanish

03

Demonstrated effectiveness of bias decoding in NER

Abstract

Recurrent Neural Network models are the state-of-the-art for Named Entity Recognition (NER). We present two innovations to improve the performance of these models. The first innovation is the introduction of residual connections between the Stacked Recurrent Neural Network model to address the degradation problem of deep neural networks. The second innovation is a bias decoding mechanism that allows the trained system to adapt to non-differentiable and externally computed objectives, such as the entity-based F-measure. Our work improves the state-of-the-art results for both Spanish and English languages on the standard train/development/test split of the CoNLL 2003 Shared Task NER dataset.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems