Chinese NER Using Lattice LSTM

Yue Zhang; Jie Yang

arXiv:1805.02023·cs.CL·July 6, 2018·34 cites

Chinese NER Using Lattice LSTM

Yue Zhang, Jie Yang

PDF

Open Access 3 Repos

TL;DR

This paper introduces a lattice-structured LSTM model for Chinese Named Entity Recognition that effectively combines character and word information, outperforming traditional methods by reducing segmentation errors and leveraging richer context.

Contribution

The paper presents a novel lattice LSTM model that integrates lexicon-matched words with character sequences for improved Chinese NER performance.

Findings

01

Lattice LSTM outperforms character-based models.

02

Lattice LSTM surpasses word-based models.

03

Model achieves state-of-the-art results on multiple datasets.

Abstract

We investigate a lattice-structured LSTM model for Chinese NER, which encodes a sequence of input characters as well as all potential words that match a lexicon. Compared with character-based methods, our model explicitly leverages word and word sequence information. Compared with word-based methods, lattice LSTM does not suffer from segmentation errors. Gated recurrent cells allow our model to choose the most relevant characters and words from a sentence for better NER results. Experiments on various datasets show that lattice LSTM outperforms both word-based and character-based LSTM baselines, achieving the best results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory