Generalized Zero-shot ICD Coding
Congzheng Song, Shanghang Zhang, Najmeh Sadoughi, Pengtao Xie, Eric, Xing

TL;DR
This paper introduces a novel adversarial generative model for generalized zero-shot ICD coding, significantly improving prediction accuracy for unseen codes while maintaining performance on seen codes, using hierarchical and semantic features.
Contribution
It proposes the first adversarial generative approach for generalized zero-shot multi-label text classification, leveraging hierarchical structure and semantic reconstruction.
Findings
Zero-shot F1 score improved from nearly 0 to 20.91%.
AUC score increased by 3% absolute.
Framework also enhances few-shot code performance.
Abstract
The International Classification of Diseases (ICD) is a list of classification codes for the diagnoses. Automatic ICD coding is in high demand as the manual coding can be labor-intensive and error-prone. It is a multi-label text classification task with extremely long-tailed label distribution, making it difficult to perform fine-grained classification on both frequent and zero-shot codes at the same time. In this paper, we propose a latent feature generation framework for generalized zero-shot ICD coding, where we aim to improve the prediction on codes that have no labeled data without compromising the performance on seen codes. Our framework generates pseudo features conditioned on the ICD code descriptions and exploits the ICD code hierarchical structure. To guarantee the semantic consistency between the generated features and real features, we reconstruct the keywords in the input…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Topic Modeling · Natural Language Processing Techniques
