ELMV: an Ensemble-Learning Approach for Analyzing Electrical Health Records with Significant Missing Values
Lucas J. Liu, Hongwei Zhang, Jianzhong Di, Jin Chen

TL;DR
The paper introduces ELMV, an ensemble-learning framework designed to analyze electronic health records with high missing data, reducing bias and improving accuracy over traditional imputation methods.
Contribution
ELMV is a novel ensemble-learning approach that constructs multiple low-missing-rate data subsets and uses a support set to enhance analysis of incomplete EHR data.
Findings
ELMV outperforms traditional imputation methods in real-world healthcare data.
ELMV achieves higher accuracy in outcome prediction across various missing data rates.
ELMV effectively identifies critical features despite substantial missing values.
Abstract
Many real-world Electronic Health Record (EHR) data contains a large proportion of missing values. Leaving substantial portion of missing information unaddressed usually causes significant bias, which leads to invalid conclusion to be drawn. On the other hand, training a machine learning model with a much smaller nearly-complete subset can drastically impact the reliability and accuracy of model inference. Data imputation algorithms that attempt to replace missing data with meaningful values inevitably increase the variability of effect estimates with increased missingness, making it unreliable for hypothesis validation. We propose a novel Ensemble-Learning for Missing Value (ELMV) framework, which introduces an effective approach to construct multiple subsets of the original EHR data with a much lower missing rate, as well as mobilizing a dedicated support set for the ensemble learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Fault Diagnosis Techniques · Electricity Theft Detection Techniques · Energy Load and Power Forecasting
