Protocol for processing multivariate time-series electronic health records of COVID-19 patients
Zixiang Wang, Yinghao Zhu, Dehao Sui, Tianlong Wang, Yuntao Zhang, Yasha Wang, Chengwei Pan, Junyi Gao, Liantao Ma, Ling Wang, Xiaoyun Zhang

TL;DR
This paper introduces a standardized protocol for processing complex electronic health records of COVID-19 patients to improve AI-based predictions of hospital outcomes.
Contribution
A detailed, reproducible protocol for standardizing and processing multivariate time-series EHR data for AI model training in the context of COVID-19.
Findings
The protocol includes steps for data standardization, formatting, and model training.
It focuses on predicting in-hospital mortality and length of stay for COVID-19 patients.
The method aims to improve the accuracy of predictive models by addressing data processing inconsistencies.
Abstract
The lack of standardized techniques for processing complex health data from COVID-19 patients hinders the development of accurate predictive models in healthcare. To address this, we present a protocol for utilizing real-world multivariate time-series electronic health records of COVID-19 patients. We describe steps for covering the necessary setup, data standardization, and formatting. We then provide detailed instructions for creating datasets and for training and evaluating AI models designed to predict two key outcomes: in-hospital mortality and length of stay. For complete details on the use and execution of this protocol, please refer to Gao et al.1 •Steps for standardizing multivariate time-series EHR data format of COVID-19 patients•Instructions for processing EHR data of COVID-19 patients for training AI models•Guidance on training and evaluating AI models through tailored…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Anomaly Detection Techniques and Applications · Time Series Analysis and Forecasting
