Comparing methods for handling missing data in electronic health records for dynamic risk prediction of central-line associated bloodstream infection
Shan Gao, Elena Albu, Pieter Stijnen, Frank Rademakers, Veerle Cossey, Yves Debaveye, Christel Janssens, Ben Van Calster, Laure Wynants

TL;DR
This study compares various imputation strategies for missing data in electronic health records to improve dynamic risk prediction of bloodstream infections, finding that missing indicators often outperform other methods.
Contribution
It demonstrates that missing indicators can be highly effective for risk prediction in EHR data, especially when combined with other imputation methods.
Findings
Missing indicators achieved the highest AUROC (up to 0.782).
Combining missing indicators with mixed models improved performance at day 4.
Using missing indicators alone or with other methods enhances dynamic risk prediction.
Abstract
Electronic health records (EHR) often contain varying levels of missing data. This study compared different imputation strategies to identify the most suitable approach for predicting central line-associated bloodstream infection (CLABSI) in the presence of competing risks using EHR data. We analyzed 30862 catheter episodes at University Hospitals Leuven (2012-2013) to predict 7-day CLABSI risk using a landmark cause-specific supermodel, accounting for competing risks of hospital discharge and death. Imputation methods included simple methods (median/mode, last observation carried forward), multiple imputation, regression-based and mixed-effects models leveraging longitudinal structure, and random forest imputation to capture interactions and non-linearities. Missing indicators were also assessed alone and in combination with other imputation methods. Model performance was evaluated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSepsis Diagnosis and Treatment · Antimicrobial Resistance in Staphylococcus · Surgical site infection prevention
