Time-dependent Iterative Imputation for Multivariate Longitudinal Clinical Data
Omer Noy, Ron Shamir

TL;DR
This paper introduces TDI, a novel time-dependent iterative imputation method that effectively handles missing data in multivariate longitudinal clinical datasets, improving imputation accuracy and downstream risk prediction.
Contribution
The paper presents TDI, a new imputation approach combining forward-filling and iterative imputation with dynamic weighting tailored for clinical time-series data.
Findings
TDI outperformed state-of-the-art methods on clinical datasets.
TDI achieved lower root-mean-squared-error in imputations.
Imputation with TDI improved risk prediction accuracy.
Abstract
Missing data is a major challenge in clinical research. In electronic medical records, often a large fraction of the values in laboratory tests and vital signs are missing. The missingness can lead to biased estimates and limit our ability to draw conclusions from the data. Additionally, many machine learning algorithms can only be applied to complete datasets. A common solution is data imputation, the process of filling-in the missing values. However, some of the popular imputation approaches perform poorly on clinical data. We developed a simple new approach, Time-Dependent Iterative imputation (TDI), which offers a practical solution for imputing time-series data. It addresses both multivariate and longitudinal data, by integrating forward-filling and Iterative Imputer. The integration employs a patient, variable, and observation-specific dynamic weighting strategy, based on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Statistical Methods and Bayesian Inference · Meta-analysis and systematic reviews
