# Benchmarking imputation strategies for missing time-series data in critical care using real-world-inspired scenarios

**Authors:** Michael Poette, Sandrine Mouysset, Daniel Ruiz, Vincent Pey, Jean-Marc Alliot, Vincent Minville

PMC · DOI: 10.1038/s41598-026-39035-z · 2026-02-10

## TL;DR

This paper compares different methods for filling in missing data in ICU time-series data, finding that deep learning models perform best but simpler methods like linear interpolation are often nearly as good.

## Contribution

The study introduces a practical framework for evaluating time-series imputation strategies under realistic ICU conditions.

## Key findings

- Transformer and GAN models achieved the best overall performance in imputing missing ICU data.
- Linear interpolation remained a strong baseline despite its simplicity.
- Results varied significantly depending on the type of missingness scenario.

## Abstract

Handling missing data remains a central challenge in Intensive Care Units (ICU) time-series analysis, where gaps frequently arise from non-random mechanisms such as sensor disconnections and workflow-driven interruptions. In this study, we benchmarked multiple imputation strategies on monitoring data from MIMIC-IV and designed masking scenarios that reflect ICU missingness patterns observed in the database, thereby approximating real-world conditions and clarifying how conclusions depend on both the chosen imputation method and the missingness scenario. We compared commonly used simple statistical approaches (mean, LOCF, interpolation), classical machine learning techniques (MICE, MissForest), and several deep learning architectures (Transformers, RNNs, GANs, VAEs). Transformer and GAN models achieved the best overall performance, whereas linear interpolation remained a strong baseline. Crucially, results were scenario-dependent: MCAR produced optimistic error estimates and compressed differences between methods, whereas structured gaps revealed clearer performance separations. Our findings suggest that, while deep learning methods improve overall imputation accuracy, linear interpolation is often nearly as effective and offers a lighter, more interpretable approach. This work introduces a practical framework for evaluating time-series imputation strategies under realistic constraints, with a focus on clinical relevance. Further analysis of downstream impact under clinically realistic scenarios and using tailored imputation strategies by variable type remains needed.

## Full-text entities

- **Genes:** Mbp (myelin basic protein) [NCBI Gene 17196] {aka Hmbpr, golli-mbp, jve, mld, shi}
- **Diseases:** MIMIC-IV (MESH:D006011)
- **Chemicals:** SpO (-), Oxygen (MESH:D010100)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** S093336572300101X

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12960818/full.md

---
Source: https://tomesphere.com/paper/PMC12960818