Integrated Analysis for Electronic Health Records with Structured and Sporadic Missingness
Jianbin Tan, Yan Zhang, Chuan Hong, T. Tony Cai, Tianxi Cai, Anru R. Zhang

TL;DR
This paper introduces Macomss, a novel imputation method designed for electronic health records with complex missing data patterns, improving data integration and analysis accuracy in healthcare research.
Contribution
The paper presents Macomss, a new imputation framework with theoretical guarantees, tailored for structured and sporadic missingness in EHR data, validated through simulations and real-world datasets.
Findings
Outperforms existing imputation methods in simulations
Achieves lowest imputation errors in DUHS datasets
Provides superior or comparable downstream prediction performance
Abstract
Objectives: We propose a novel imputation method tailored for Electronic Health Records (EHRs) with structured and sporadic missingness. Such missingness frequently arises in the integration of heterogeneous EHR datasets for downstream clinical applications. By addressing these gaps, our method provides a practical solution for integrated analysis, enhancing data utility and advancing the understanding of population health. Materials and Methods: We begin by demonstrating structured and sporadic missing mechanisms in the integrated analysis of EHR data. Following this, we introduce a novel imputation framework, Macomss, specifically designed to handle structurally and heterogeneously occurring missing data. We establish theoretical guarantees for Macomss, ensuring its robustness in preserving the integrity and reliability of integrated analyses. To assess its empirical performance, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Chronic Disease Management Strategies · Artificial Intelligence in Healthcare
