An Investigation of Recurrent Neural Architectures for Drug Name   Recognition

Raghavendra Chalapathy; Ehsan Zare Borzeshi; Massimo Piccardi

arXiv:1609.07585·cs.CL·September 27, 2016

An Investigation of Recurrent Neural Architectures for Drug Name Recognition

Raghavendra Chalapathy, Ehsan Zare Borzeshi, Massimo Piccardi

PDF

1 Repo

TL;DR

This paper evaluates recurrent neural network architectures for drug name recognition in biomedical texts, demonstrating that bidirectional LSTM-CRF models perform comparably to traditional hand-crafted systems.

Contribution

It investigates the effectiveness of modern recurrent neural architectures for DNR, highlighting the potential of neural models to replace handcrafted feature-based methods.

Findings

01

Bidirectional LSTM-CRF achieves performance close to specialized systems.

02

Recurrent neural architectures can effectively perform DNR from raw text.

03

Neural models reduce reliance on domain-specific feature engineering.

Abstract

Drug name recognition (DNR) is an essential step in the Pharmacovigilance (PV) pipeline. DNR aims to find drug name mentions in unstructured biomedical texts and classify them into predefined categories. State-of-the-art DNR approaches heavily rely on hand crafted features and domain specific resources which are difficult to collect and tune. For this reason, this paper investigates the effectiveness of contemporary recurrent neural architectures - the Elman and Jordan networks and the bidirectional LSTM with CRF decoding - at performing DNR straight from the text. The experimental results achieved on the authoritative SemEval-2013 Task 9.1 benchmarks show that the bidirectional LSTM-CRF ranks closely to highly-dedicated, hand-crafted systems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

raghavchalapathy/dnr
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Conditional Random Field · Long Short-Term Memory