SMART: Towards Pre-trained Missing-Aware Model for Patient Health Status Prediction
Zhihao Yu, Xu Chu, Yujie Jin, Yasha Wang, Junfeng Zhao

TL;DR
SMART introduces a self-supervised, missing-aware representation learning method that effectively handles missing data in EHRs, improving patient health status prediction accuracy and robustness.
Contribution
It proposes a novel self-supervised pre-training approach with missing-aware attentions to better encode and impute missing EHR data.
Findings
Outperforms state-of-the-art methods on six EHR tasks.
Enhances robustness to missing data.
Improves generalization in health status prediction.
Abstract
Electronic health record (EHR) data has emerged as a valuable resource for analyzing patient health status. However, the prevalence of missing data in EHR poses significant challenges to existing methods, leading to spurious correlations and suboptimal predictions. While various imputation techniques have been developed to address this issue, they often obsess unnecessary details and may introduce additional noise when making clinical predictions. To tackle this problem, we propose SMART, a Self-Supervised Missing-Aware RepresenTation Learning approach for patient health status prediction, which encodes missing information via elaborated attentions and learns to impute missing values through a novel self-supervised pre-training approach that reconstructs missing data representations in the latent space. By adopting missing-aware attentions and focusing on learning higher-order…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare
