ICD Coding from Clinical Text Using Multi-Filter Residual Convolutional Neural Network
Fei Li, Hong Yu

TL;DR
This paper introduces a Multi-Filter Residual CNN that effectively captures diverse text patterns for automated ICD coding, outperforming previous models on standard clinical datasets.
Contribution
The paper proposes a novel Multi-Filter Residual CNN architecture that enhances document representation for ICD coding by capturing varied text patterns and enlarging the receptive field.
Findings
Outperforms state-of-the-art models on MIMIC datasets
Effective in capturing diverse text patterns for ICD coding
Achieves superior evaluation metrics across multiple benchmarks
Abstract
Automated ICD coding, which assigns the International Classification of Disease codes to patient visits, has attracted much research attention since it can save time and labor for billing. The previous state-of-the-art model utilized one convolutional layer to build document representations for predicting ICD codes. However, the lengths and grammar of text fragments, which are closely related to ICD coding, vary a lot in different documents. Therefore, a flat and fixed-length convolutional architecture may not be capable of learning good document representations. In this paper, we proposed a Multi-Filter Residual Convolutional Neural Network (MultiResCNN) for ICD coding. The innovations of our model are two-folds: it utilizes a multi-filter convolutional layer to capture various text patterns with different lengths and a residual convolutional layer to enlarge the receptive field. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Natural Language Processing Techniques
