Unsupervised Pre-Training on Patient Population Graphs for Patient-Level Predictions
Chantal Pellegrini, Anees Kazi, Nassir Navab

TL;DR
This paper introduces an unsupervised pre-training approach using graph transformer models on heterogeneous EHR data to improve patient outcome predictions, demonstrating significant performance gains on medical datasets.
Contribution
It develops a novel graph transformer architecture for multi-modal EHR data and proposes masked imputation pre-training methods for better patient-level predictions.
Findings
Pre-training improves AUC by 4.15% on MIMIC-III.
Pre-training improves AUC by 7.64% on TADPOLE.
Graph-based pre-training enhances modeling of population-level medical data.
Abstract
Pre-training has shown success in different areas of machine learning, such as Computer Vision (CV), Natural Language Processing (NLP) and medical imaging. However, it has not been fully explored for clinical data analysis. Even though an immense amount of Electronic Health Record (EHR) data is recorded, data and labels can be scarce if the data is collected in small hospitals or deals with rare diseases. In such scenarios, pre-training on a larger set of EHR data could improve the model performance. In this paper, we apply unsupervised pre-training to heterogeneous, multi-modal EHR data for patient outcome prediction. To model this data, we leverage graph deep learning over population graphs. We first design a network architecture based on graph transformer designed to handle various input feature types occurring in EHR data, like continuous, discrete, and time-series features,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Artificial Intelligence in Healthcare · Artificial Intelligence in Healthcare and Education
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Laplacian EigenMap · Byte Pair Encoding · Residual Connection · Position-Wise Feed-Forward Layer · Dense Connections · Softmax · Label Smoothing
