TL;DR
This paper introduces TA-DualCV, a novel time-aware method for imputing missing data in EHRs that leverages temporal and feature dependencies to improve early prediction of septic shock.
Contribution
The paper presents TA-DualCV, a new imputation approach that captures both within- and cross-visit dependencies and temporal patterns in EHR data, outperforming existing methods.
Findings
TA-DualCV significantly outperforms state-of-the-art imputation methods.
It improves 24-hour septic shock prediction accuracy.
Effective with up to 90% missing data.
Abstract
Real-world Electronic Health Records (EHRs) are often plagued by a high rate of missing data. In our EHRs, for example, the missing rates can be as high as 90% for some features, with an average missing rate of around 70% across all features. We propose a Time-Aware Dual-Cross-Visit missing value imputation method, named TA-DualCV, which spontaneously leverages multivariate dependencies across features and longitudinal dependencies both within- and cross-visit to maximize the information extracted from limited observable records in EHRs. Specifically, TA-DualCV captures the latent structure of missing patterns across measurements of different features and it also considers the time continuity and capture the latent temporal missing patterns based on both time-steps and irregular time-intervals. TA-DualCV is evaluated using three large real-world EHRs on two types of tasks: an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
