An Encoder-Decoder Model for ICD-10 Coding of Death Certificates

Elena Tutubalina; Zulfat Miftahutdinov

arXiv:1712.01213·cs.CL·May 3, 2018·2 cites

An Encoder-Decoder Model for ICD-10 Coding of Death Certificates

Elena Tutubalina, Zulfat Miftahutdinov

PDF

Open Access

TL;DR

This paper presents an encoder-decoder neural network model for automatically assigning ICD-10 codes to death certificate texts, achieving high accuracy and outperforming previous methods.

Contribution

The work introduces a novel end-to-end neural architecture with knowledge integration for ICD-10 coding from death certificates.

Findings

01

Achieved 85.01% F-measure on benchmark dataset

02

Significant improvement over average participant scores

03

Demonstrated effectiveness of encoder-decoder models in medical coding

Abstract

Information extraction from textual documents such as hospital records and healthrelated user discussions has become a topic of intense interest. The task of medical concept coding is to map a variable length text to medical concepts and corresponding classification codes in some external system or ontology. In this work, we utilize recurrent neural networks to automatically assign ICD-10 codes to fragments of death certificates written in English. We develop end-to-end neural architectures directly tailored to the task, including basic encoder-decoder architecture for statistical translation. In order to incorporate prior knowledge, we concatenate cosine similarities vector among the text and dictionary entry to the encoded state. Being applied to a standard benchmark from CLEF eHealth 2017 challenge, our model achieved F-measure of 85.01% on a full test set with significant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Text Mining and Ontologies · Machine Learning in Healthcare · Topic Modeling