A Reproducible Post-Valve-Replacement EHR Cohort for Comparative AI Studies
Malte Blattmann, Mika Katalinic, Adrian Lindenmeyer, Stefan Franke, Thomas Neumuth, Daniel Schneider

TL;DR
This paper introduces a reproducible EHR dataset for valve replacement patients to evaluate AI models in predicting postoperative risks.
Contribution
The novel contribution is a publicly available pipeline and benchmark for longitudinal EHR analysis in post-valve replacement care.
Findings
ICU readmission predicted in-hospital and 100-day outcomes like mortality and complications.
A sequential Transformer model outperformed non-sequential models with 0.87 AUROC and 0.69 AUPRC.
Abstract
Background/Objectives: Valve replacement (VR) patients are at high risk of postoperative complications, but reproducible Electronic Health Record (EHR) benchmarks for evaluating sequential AI models in this setting are lacking. We develop a reproducible pipeline that extracts two EHR datasets from MIMIC-IV (a general-purpose and a predictive benchmark dataset) capturing perioperative histories, high-resolution time-series, and clinically motivated outcome labels. Methods: The cohort comprises 3890 VR patients with clinician-guided feature selection across diagnoses, procedures, laboratory measurements, medications, and physiological monitoring. As an exemplary use case, we define ICU readmission at first ICU discharge as a surrogate for postoperative risk and derive a predictive benchmark under strict label-leakage control. We then compare a Transformer model trained on tokenized…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Sepsis Diagnosis and Treatment · Electronic Health Records Systems
