Benchmarking imputation strategies for missing time-series data in critical care using real-world-inspired scenarios
Michael Poette, Sandrine Mouysset, Daniel Ruiz, Vincent Pey, Jean-Marc Alliot, Vincent Minville

TL;DR
This paper compares different methods for filling in missing data in ICU time-series data, finding that deep learning models perform best but simpler methods like linear interpolation are often nearly as good.
Contribution
The study introduces a practical framework for evaluating time-series imputation strategies under realistic ICU conditions.
Findings
Transformer and GAN models achieved the best overall performance in imputing missing ICU data.
Linear interpolation remained a strong baseline despite its simplicity.
Results varied significantly depending on the type of missingness scenario.
Abstract
Handling missing data remains a central challenge in Intensive Care Units (ICU) time-series analysis, where gaps frequently arise from non-random mechanisms such as sensor disconnections and workflow-driven interruptions. In this study, we benchmarked multiple imputation strategies on monitoring data from MIMIC-IV and designed masking scenarios that reflect ICU missingness patterns observed in the database, thereby approximating real-world conditions and clarifying how conclusions depend on both the chosen imputation method and the missingness scenario. We compared commonly used simple statistical approaches (mean, LOCF, interpolation), classical machine learning techniques (MICE, MissForest), and several deep learning architectures (Transformers, RNNs, GANs, VAEs). Transformer and GAN models achieved the best overall performance, whereas linear interpolation remained a strong baseline.…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSepsis Diagnosis and Treatment · Healthcare Technology and Patient Monitoring · Machine Learning in Healthcare
