Word embeddings and recurrent neural networks based on Long-Short Term   Memory nodes in supervised biomedical word sense disambiguation

Antonio Jimeno Yepes

arXiv:1604.02506·cs.CL·December 20, 2016

Word embeddings and recurrent neural networks based on Long-Short Term Memory nodes in supervised biomedical word sense disambiguation

Antonio Jimeno Yepes

PDF

TL;DR

This paper demonstrates that combining word embeddings with traditional features and LSTM-based neural networks significantly improves supervised biomedical word sense disambiguation accuracy.

Contribution

It introduces the use of word embeddings and LSTM neural networks for biomedical word sense disambiguation, achieving state-of-the-art results.

Findings

01

Word embeddings enhance traditional feature performance.

02

LSTM classifiers outperform other models.

03

Achieved 95.97% macro accuracy on MSH WSD dataset.

Abstract

Word sense disambiguation helps identifying the proper sense of ambiguous words in text. With large terminologies such as the UMLS Metathesaurus ambiguities appear and highly effective disambiguation methods are required. Supervised learning algorithm methods are used as one of the approaches to perform disambiguation. Features extracted from the context of an ambiguous word are used to identify the proper sense of such a word. The type of features have an impact on machine learning methods, thus affect disambiguation performance. In this work, we have evaluated several types of features derived from the context of the ambiguous word and we have explored as well more global features derived from MEDLINE using word embeddings. Results show that word embeddings improve the performance of more traditional features and allow as well using recurrent neural network classifiers based on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSupport Vector Machine