Auxiliary Knowledge-Induced Learning for Automatic Multi-Label Medical   Document Classification

Xindi Wang; Robert E. Mercer; Frank Rudzicz

arXiv:2405.19084·cs.CL·May 30, 2024

Auxiliary Knowledge-Induced Learning for Automatic Multi-Label Medical Document Classification

Xindi Wang, Robert E. Mercer, Frank Rudzicz

PDF

Open Access

TL;DR

This paper presents a novel multi-label medical document classification method that integrates deep text encoding, auxiliary medical knowledge, and ICD code co-occurrence patterns to improve automatic ICD coding accuracy.

Contribution

It introduces a multi-level deep dilated residual convolution encoder combined with auxiliary knowledge and graph convolutional networks for enhanced ICD code prediction.

Findings

01

Achieves state-of-the-art performance on ICD coding tasks.

02

Effectively leverages auxiliary medical knowledge and code co-occurrence.

03

Improves classification accuracy over existing methods.

Abstract

The International Classification of Diseases (ICD) is an authoritative medical classification system of different diseases and conditions for clinical and management purposes. ICD indexing assigns a subset of ICD codes to a medical record. Since human coding is labour-intensive and error-prone, many studies employ machine learning to automate the coding process. ICD coding is a challenging task, as it needs to assign multiple codes to each medical document from an extremely large hierarchically organized collection. In this paper, we propose a novel approach for ICD indexing that adopts three ideas: (1) we use a multi-level deep dilated residual convolution encoder to aggregate the information from the clinical notes and learn document representations across different lengths of the texts; (2) we formalize the task of ICD classification with auxiliary knowledge of the medical records,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies

MethodsConvolution