Comparing CNN and LSTM character-level embeddings in BiLSTM-CRF models   for chemical and disease named entity recognition

Zenan Zhai; Dat Quoc Nguyen; Karin Verspoor

arXiv:1808.08450·cs.CL·August 28, 2018

Comparing CNN and LSTM character-level embeddings in BiLSTM-CRF models for chemical and disease named entity recognition

Zenan Zhai, Dat Quoc Nguyen, Karin Verspoor

PDF

Open Access

TL;DR

This paper compares CNN and LSTM character-level embeddings in BiLSTM-CRF models for chemical and disease NER, finding similar accuracy but different training efficiencies.

Contribution

It provides a comparative analysis of CNN and LSTM character embeddings in NER models, highlighting their performance and computational trade-offs.

Findings

01

Both CNN and LSTM embeddings achieve state-of-the-art NER performance.

02

CNN embeddings are computationally more efficient than LSTM embeddings.

03

LSTM embeddings significantly increase training time compared to CNN embeddings.

Abstract

We compare the use of LSTM-based and CNN-based character-level word embeddings in BiLSTM-CRF models to approach chemical and disease named entity recognition (NER) tasks. Empirical results over the BioCreative V CDR corpus show that the use of either type of character-level word embeddings in conjunction with the BiLSTM-CRF models leads to comparable state-of-the-art performance. However, the models using CNN-based character-level word embeddings have a computational performance advantage, increasing training time over word-based models by 25% while the LSTM-based character-level word embeddings more than double the required training time.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies