TL;DR
This paper introduces a label attention model for automatic ICD coding from clinical notes, effectively handling variable text fragment lengths and code interdependence, while addressing data imbalance through hierarchical joint learning, achieving state-of-the-art results.
Contribution
The paper presents a novel label attention model with hierarchical joint learning to improve ICD coding accuracy, especially for infrequent codes, outperforming previous CNN-based models.
Findings
Achieves new state-of-the-art results on MIMIC datasets.
Effectively handles variable text fragment lengths and code interdependence.
Improves performance on infrequent ICD codes.
Abstract
ICD coding is a process of assigning the International Classification of Disease diagnosis codes to clinical/medical notes documented by health professionals (e.g. clinicians). This process requires significant human resources, and thus is costly and prone to error. To handle the problem, machine learning has been utilized for automatic ICD coding. Previous state-of-the-art models were based on convolutional neural networks, using a single/several fixed window sizes. However, the lengths and interdependence between text fragments related to ICD codes in clinical text vary significantly, leading to the difficulty of deciding what the best window sizes are. In this paper, we propose a new label attention model for automatic ICD coding, which can handle both the various lengths and the interdependence of the ICD code related text fragments. Furthermore, as the majority of ICD codes are not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
