Bridging Electronic Health Records and Clinical Texts: Contrastive Learning for Enhanced Clinical Tasks
Sara Ketabi, Dhanesh Ramachandram

TL;DR
This paper introduces a contrastive learning framework that aligns structured EHR data with unstructured clinical texts, significantly improving performance on clinical prediction tasks like hospital readmission.
Contribution
It presents a novel deep multimodal contrastive learning approach that enhances EHR models by integrating unstructured clinical notes for better contextual understanding.
Findings
4.1% AUROC improvement in readmission prediction
Effective alignment of EHR and clinical texts
Enhanced downstream clinical task performance
Abstract
Conventional machine learning models, particularly tree-based approaches, have demonstrated promising performance across various clinical prediction tasks using electronic health record (EHR) data. Despite their strengths, these models struggle with tasks that require deeper contextual understanding, such as predicting 30-day hospital readmission. This can be primarily due to the limited semantic information available in structured EHR data. To address this limitation, we propose a deep multimodal contrastive learning (CL) framework that aligns the latent representations of structured EHR data with unstructured discharge summary notes. It works by pulling together paired EHR and text embeddings while pushing apart unpaired ones. Fine-tuning the pretrained EHR encoder extracted from this framework significantly boosts downstream task performance, e.g., a 4.1% AUROC enhancement over…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
