EMERGE: Enhancing Multimodal Electronic Health Records Predictive Modeling with Retrieval-Augmented Generation
Yinghao Zhu, Changyu Ren, Zixiang Wang, Xiaochen Zheng, Shiyun Xie,, Junlan Feng, Xi Zhu, Zhoujun Li, Liantao Ma, Chengwei Pan

TL;DR
EMERGE is a novel framework that enhances multimodal EHR predictive modeling by integrating retrieval-augmented generation with medical knowledge, improving clinical prediction accuracy.
Contribution
It introduces a retrieval-augmented generation approach that extracts and aligns medical entities from multimodal EHR data with knowledge graphs for richer clinical context.
Findings
Outperforms baseline models on MIMIC datasets
Demonstrates robustness to data sparsity
Provides effective patient health summaries
Abstract
The integration of multimodal Electronic Health Records (EHR) data has significantly advanced clinical predictive capabilities. Existing models, which utilize clinical notes and multivariate time-series EHR data, often fall short of incorporating the necessary medical context for accurate clinical tasks, while previous approaches with knowledge graphs (KGs) primarily focus on structured knowledge extraction. In response, we propose EMERGE, a Retrieval-Augmented Generation (RAG) driven framework to enhance multimodal EHR predictive modeling. We extract entities from both time-series data and clinical notes by prompting Large Language Models (LLMs) and align them with professional PrimeKG, ensuring consistency. In addition to triplet relationships, we incorporate entities' definitions and descriptions for richer semantics. The extracted knowledge is then used to generate task-relevant…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare
MethodsFocus
