A Multidatabase ExTRaction PipEline (METRE) for Facile Cross Validation in Critical Care Research
Wei Liao, Joel Voldman

TL;DR
This paper introduces METRE, an open-source extraction pipeline that simplifies cross-institutional EHR data analysis from MIMIC-IV and eICU, enabling robust model validation for critical care research.
Contribution
The paper presents a novel, versatile extraction pipeline that supports cross-validation across multiple ICU databases, addressing limitations of prior tools and facilitating clinical model deployment.
Findings
Extracted over 38,000 MIMIC-IV ICU records and 126,000 eICU records.
Achieved comparable AUC performance (0.723-0.888) on mortality prediction tasks.
Demonstrated minimal AUC variation when applying models across datasets.
Abstract
Transforming raw EHR data into machine learning model-ready inputs requires considerable effort. One widely used EHR database is Medical Information Mart for Intensive Care (MIMIC). Prior work on MIMIC-III cannot query the updated and improved MIMIC-IV version. Besides, the need to use multicenter datasets further highlights the challenge of EHR data extraction. Therefore, we developed an extraction pipeline that works on both MIMIC-IV and eICU Collaborative Research Database and allows for model cross validation using these 2 databases. Under the default choices, the pipeline extracted 38766 and 126448 ICU records for MIMIC-IV and eICU, respectively. Using the extracted time-dependent variables, we compared the Area Under the Curve (AUC) performance with prior works on clinically relevant tasks such as in-hospital mortality prediction. METRE achieved comparable performance with AUC…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Sepsis Diagnosis and Treatment
