Hierarchical Pretraining on Multimodal Electronic Health Records

Xiaochen Wang; Junyu Luo; Jiaqi Wang; Ziyi Yin; Suhan Cui; Yuan Zhong,; Yaqing Wang; Fenglong Ma

arXiv:2310.07871·cs.AI·October 23, 2023·2 cites

Hierarchical Pretraining on Multimodal Electronic Health Records

Xiaochen Wang, Junyu Luo, Jiaqi Wang, Ziyi Yin, Suhan Cui, Yuan Zhong,, Yaqing Wang, Fenglong Ma

PDF

Open Access 1 Repo

TL;DR

This paper introduces MEDHMP, a hierarchical pretraining framework for multimodal electronic health records, improving generalization across diverse medical tasks by capturing the data's hierarchical structure.

Contribution

The paper presents a novel unified pretraining method tailored for hierarchically structured multimodal EHR data, addressing limitations of previous models.

Findings

01

Outperforms 18 baseline models across 8 downstream tasks

02

Effectively captures hierarchical and multimodal features of EHR data

03

Enhances model generalization in medical NLP applications

Abstract

Pretraining has proven to be a powerful technique in natural language processing (NLP), exhibiting remarkable success in various NLP downstream tasks. However, in the medical domain, existing pretrained models on electronic health records (EHR) fail to capture the hierarchical nature of EHR data, limiting their generalization capability across diverse downstream tasks using a single pretrained model. To tackle this challenge, this paper introduces a novel, general, and unified pretraining framework called MEDHMP, specifically designed for hierarchically multimodal EHR data. The effectiveness of the proposed MEDHMP is demonstrated through experimental results on eight downstream tasks spanning three levels. Comparisons against eighteen baselines further highlight the efficacy of our approach.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xiaochenwang-psu/medhmp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech and dialogue systems