Few-Shot Electronic Health Record Coding through Graph Contrastive Learning
Shanshan Wang, Pengjie Ren, Zhumin Chen, Zhaochun Ren, Huasheng Liang,, Qiang Yan, Evangelos Kanoulas, Maarten de Rijke

TL;DR
This paper introduces CoGraph, a graph contrastive learning framework that enhances EHR coding accuracy for both frequent and rare ICD codes by leveraging a heterogeneous graph structure and few-shot learning techniques.
Contribution
It proposes a novel graph contrastive learning approach, CoGraph, that effectively transfers information between ICD codes in a few-shot setting, improving coding performance on rare and frequent codes.
Findings
Significantly outperforms state-of-the-art methods on MIMIC-III dataset.
Improves classification accuracy and F1 scores for both frequent and rare ICD codes.
Demonstrates the effectiveness of graph contrastive learning in medical coding tasks.
Abstract
Electronic health record (EHR) coding is the task of assigning ICD codes to each EHR. Most previous studies either only focus on the frequent ICD codes or treat rare and frequent ICD codes in the same way. These methods perform well on frequent ICD codes but due to the extremely unbalanced distribution of ICD codes, the performance on rare ones is far from satisfactory. We seek to improve the performance for both frequent and rare ICD codes by using a contrastive graph-based EHR coding framework, CoGraph, which re-casts EHR coding as a few-shot learning task. First, we construct a heterogeneous EHR word-entity (HEWE) graph for each EHR, where the words and entities extracted from an EHR serve as nodes and the relations between them serve as edges. Then, CoGraph learns similarities and dissimilarities between HEWE graphs from different ICD codes so that information can be transferred…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Topic Modeling · Traditional Chinese Medicine Studies
MethodsContrastive Learning
