Beyond Random Missingness: Clinically Rethinking for Healthcare Time Series Imputation
Linglong Qian, Yiyuan Yang, Wenjie Du, Jun Wang, Richard Dobsoni and, Zina Ibrahim

TL;DR
This paper highlights the importance of clinically-informed masking strategies in healthcare time series imputation, showing that current random masking approaches may not accurately reflect real missing data patterns and affect model performance.
Contribution
It introduces the analysis of different masking strategies in healthcare time series imputation, emphasizing the need for clinically-relevant evaluation methods.
Findings
Masking choices significantly impact imputation accuracy.
Recurrent models perform more consistently across masking strategies.
Imputation accuracy does not always improve downstream clinical predictions.
Abstract
This study investigates the impact of masking strategies on time series imputation models in healthcare settings. While current approaches predominantly rely on random masking for model evaluation, this practice fails to capture the structured nature of missing patterns in clinical data. Using the PhysioNet Challenge 2012 dataset, we analyse how different masking implementations affect both imputation accuracy and downstream clinical predictions across eleven imputation methods. Our results demonstrate that masking choices significantly influence model performance, while recurrent architectures show more consistent performance across strategies. Analysis of downstream mortality prediction reveals that imputation accuracy doesn't necessarily translate to optimal clinical prediction capabilities. Our findings emphasise the need for clinically-informed masking strategies that better…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting
