Unsupervised Pre-Training on Patient Population Graphs for Patient-Level   Predictions

Chantal Pellegrini; Anees Kazi; Nassir Navab

arXiv:2203.12616·cs.LG·June 10, 2022

Unsupervised Pre-Training on Patient Population Graphs for Patient-Level Predictions

Chantal Pellegrini, Anees Kazi, Nassir Navab

PDF

Open Access 2 Repos

TL;DR

This paper introduces an unsupervised pre-training approach using graph transformer models on heterogeneous EHR data to improve patient outcome predictions, demonstrating significant performance gains on medical datasets.

Contribution

It develops a novel graph transformer architecture for multi-modal EHR data and proposes masked imputation pre-training methods for better patient-level predictions.

Findings

01

Pre-training improves AUC by 4.15% on MIMIC-III.

02

Pre-training improves AUC by 7.64% on TADPOLE.

03

Graph-based pre-training enhances modeling of population-level medical data.

Abstract

Pre-training has shown success in different areas of machine learning, such as Computer Vision (CV), Natural Language Processing (NLP) and medical imaging. However, it has not been fully explored for clinical data analysis. Even though an immense amount of Electronic Health Record (EHR) data is recorded, data and labels can be scarce if the data is collected in small hospitals or deals with rare diseases. In such scenarios, pre-training on a larger set of EHR data could improve the model performance. In this paper, we apply unsupervised pre-training to heterogeneous, multi-modal EHR data for patient outcome prediction. To model this data, we leverage graph deep learning over population graphs. We first design a network architecture based on graph transformer designed to handle various input feature types occurring in EHR data, like continuous, discrete, and time-series features,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Artificial Intelligence in Healthcare · Artificial Intelligence in Healthcare and Education

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Laplacian EigenMap · Byte Pair Encoding · Residual Connection · Position-Wise Feed-Forward Layer · Dense Connections · Softmax · Label Smoothing