Prognostic Significance of AI-Enhanced ECG for Emergency Department Patients
Yu-Te Su, Sy-Jou Chen, Chin Lin, Chin-Sheng Lin, Hsiao-Feng Hu

TL;DR
AI-enhanced ECGs before discharge better predict patient mortality risk than initial ECGs in emergency departments.
Contribution
Demonstrates that follow-up AI-ECG risk scores before discharge improve mortality prediction in ED patients.
Findings
Follow-up AI-ECG before discharge had an AUC of 83.3% for mortality risk prediction.
Patients transitioning to higher risk at discharge had significantly increased 90-day mortality.
AI-ECG supports safer discharge decisions but does not replace full clinical assessment.
Abstract
Background/Objectives: Artificial intelligence (AI)-enabled electrocardiogram (ECG) analysis may assist in objective and reproducible risk stratification. However, the prognostic utility of serial ECGs, particularly the follow-up ECG prior to discharge, has not been extensively studied. This study aimed to evaluate whether dynamic changes in AI-predicted ECG risk scores could enhance prediction of post-discharge outcomes. Methods: This retrospective cohort study included 11,508 ED visits from a single medical center where patients underwent two ECGs and were directly discharged. We stratified the mortality risk of patients as low risk, medium risk, and high risk based on the first and follow-up ECG prior to discharge using AI-enabled ECG models. The Area Under the Curve (AUC) was calculated for the predictive performance of the two ECGs. Kaplan–Meier (KM) curves were used for 90-day…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsECG Monitoring and Analysis · Phonocardiography and Auscultation Techniques · Emergency and Acute Care Studies
1. Introduction
Efficient patient disposition is an important challenge in emergency departments (EDs), where clinicians must balance the need for prompt care with the imperative to accurately assess risk. Despite thorough assessment and treatment, patients discharged from the ED may experience adverse outcomes that were not anticipated at the time of discharge. A previous study found that the overall 72 h revisit rate to the ED was 5.47% [1]. Furthermore, missed and delayed diagnoses in the ED can lead to serious adverse outcomes, including death, with stroke, myocardial infarction, and aortic aneurysm/dissection among the most commonly overlooked [2]. These statistics highlight the challenge of accurately identifying patients at elevated risk during the disposition process and underscore the need for more effective risk stratification tools.
The discharge process from the ED involves a complex interplay among patient factors, clinician judgment, and system constraints [3]. Traditional diagnostic tools and risk scores, while valuable, ultimately rely on clinician interpretation and can sometimes fail to capture a patient’s true clinical risk, potentially resulting in missed diagnoses or inappropriate disposition [4]. This limitation is particularly evident in the interpretation of ECGs, which have become fundamental to risk assessment in the ED, especially for patients with cardiovascular history or presenting with symptoms such as chest pain or dyspnea. In addition to conventional ECG interpretation, heart rate variability (HRV) measures such as the standard deviation of normal-to-normal intervals (SDNN) and very low frequency (VLF) have been established as predictors [5]. However, HRV captures limited aspects of cardiac dynamics and requires preprocessing, whereas deep learning models can directly analyze raw waveforms to extract richer prognostic information. This highlights the need for more objective and reliable methods to enhance ECG analysis and improve patient risk stratification.
Artificial intelligence (AI) has emerged as a promising solution to these challenges by offering capabilities that extend beyond traditional clinical assessment. Advanced AI algorithms can detect subtle patterns and abnormalities in ECG data that might escape human observation, enabling prediction of various cardiovascular conditions including heart failure, atrial fibrillation, and overall mortality risk [6,7,8]. Recent studies have demonstrated the practical application of these capabilities, showing that AI-ECG algorithms can effectively predict left ventricular systolic dysfunction and stratify in-hospital mortality risk for intensive care patients [9,10]. Furthermore, an AI-enabled ECG alert system has been shown to significantly improve patient outcomes, with one study reporting a 31% reduction in the relative risk of 90-day all-cause mortality compared to standard care based on a single ECG analysis [11]. These findings suggest that AI-enhanced ECG interpretation could provide a more accurate, data-driven foundation for clinical decision-making in the ED.
In the context of high-acuity EDs, where rapid and accurate evaluation is essential, AI-enhanced ECG interpretation could be particularly valuable. Current ED practices often involve performing multiple ECGs and serial laboratory tests to evaluate higher-risk patients, especially those presenting with cardiopulmonary symptoms. However, no studies have investigated the comparative prognostic value of initial versus follow-up ECGs during an ED visit, nor how AI-enhanced interpretation might optimize this process. This study aims to address this gap by assessing the prognostic value of changes in AI-enabled ECG risk stratification for ED patients. We hypothesize that the predictive accuracy of follow-up ECGs will be superior to that of initial ECGs, potentially offering a more reliable basis for disposition decisions and improving patient outcomes.
2. Materials and Methods
2.1. Study Design and Patients
This retrospective cohort study was conducted in the ED of Tri-Service General Hospital (TSGH) from July 2012 to December 2022. The study included patients who underwent at least two ECG examinations during their ED visit. The exclusion criteria were as follows: patients with a time interval exceeding 1 h from admission to the first ECG, as this interval may not accurately represent the initial condition; patients with a time interval of less than 1 h between the first and second ECGs, as this short interval may not allow for significant physiological changes to manifest; patients with a time interval exceeding 2 h from the second ECG to discharge, as this may not accurately reflect the condition at discharge; patients with a total ED stay exceeding 48 h, as this may involve more complex medical conditions and treatments; and patients who died during their ED visit. Patients who met inclusion criteria and were directly discharged alive formed the final study cohort. This study was approved by the Institutional Review Board of TSGH, Taipei, Taiwan (IRB No. C202105049), which waived the requirement for informed consent due to the retrospective nature of the research.
2.2. ECG Acquisition and Preprocessing
All electrocardiograms were acquired as standard 12-lead 10 s recordings using a digital ECG system with a sampling frequency of 500 Hz and an amplitude resolution of 5 microvolts per bit. The raw ECG waveforms underwent preprocessing that included bandpass filtering between 0.5 and 40 Hz to reduce baseline wander and high-frequency noise, followed by normalization to zero mean and unit variance across each lead. Each ECG was segmented into uniform 10 s matrices containing 12 leads, forming 5000 samples. In addition to waveform-level data, structured ECG features were extracted from machine-generated ECG interpretations using a standardized diagnostic classification system implemented in the Philips ECG platform, as described in our previous study [12]. These included 31 diagnostic pattern classes (e.g., atrial fibrillation, ventricular premature complexes, QT prolongation) and 8 continuous ECG measurements (e.g., heart rate, PR/QRS/QTc intervals, T/QRS axes). These features were used in secondary analyses to evaluate their potential association with AI-ECG risk predictions.
2.3. AI Model Architecture and Training
We developed a deep learning model based on an 82-layer convolutional neural network (CNN) architecture adapted from our previous studies [12], which originally demonstrated robust prognostic performance in large-scale ECG datasets. Briefly, the network incorporated multiple convolutional blocks with ReLU activations and batch normalization, followed by fully connected layers and a final sigmoid output neuron generating continuous probability scores. The dataset was partitioned by patient into development, tuning, and internal validation subsets to prevent cross-contamination. Model training was performed using the Adam optimizer (initial learning rate 0.001), a batch size of 32, L2 regularization, oversampling to address class imbalance, and early stopping based on validation loss. Model development and inference were implemented in Python 3.8 (TensorFlow and Keras). The model was trained on more than 450,000 ECGs with all-cause mortality as the primary outcome label, using survival data that included censored events. Its predictive performance was evaluated in an external validation dataset, achieving an Area Under the Curve (AUC) of 0.886 for 90-day mortality prediction. We applied gradient boosting models (XGBoost) using structured ECG features and patient demographics to estimate their relative importance in predicting AI-derived risk scores, as described in our earlier study [11]. A representative visual example of the AI model’s output and interpretability interface is presented in Figure S1 (Supplementary Materials).
2.4. AI-Enabled ECG Risk Stratification
For each ECG, the model generated a continuous probability score predicting 90-day all-cause mortality, which was transformed into percentile ranks relative to a hospital-wide reference population and categorized into three risk groups: low (<75th percentile), medium (75th–94th percentile), and high (≥95th percentile). For each patient, both the initial ECG at ED presentation and the follow-up ECG prior to discharge were analyzed. To assess temporal changes in risk classification, four transition groups were defined: low-to-low, medium/high-to-low, low-to-medium/high, and medium/high-to-medium/high. Patient survival status was determined using electronic medical records (EMRs). Although deaths occurring outside our institution may not have been captured, this limitation was considered minimal because only 0.16% of readmissions occurred at external hospitals during the reference period [13].
2.5. Statistical Analysis
The predictive performance of the AI-enabled ECG model was assessed by calculating the AUC. Cumulative survival probabilities within 90 days were estimated using Kaplan–Meier methods and compared across groups using the log-rank test. Associations between risk categories and all-cause mortality were assessed using Cox proportional hazards regression, and hazard ratios with 95% confidence intervals were reported. All statistical analyses were performed using R version 3.4.4, and a two-sided p-value less than 0.05 was considered statistically significant.
3. Results
3.1. Study Flowchart and AI-Enabled ECG Risk Categorization
There were 37,027 patients who underwent at least two ECG examinations during the study period, and 21,866 patients were excluded. A total of 15,161 patients departed from the ED, with 11,508 patients discharged directly from the ED included in the final analysis. The AI-enabled ECG risk prediction model, based on the initial and follow-up ECGs prior to discharge, indicated transitions among different risk groups, as illustrated in Figure 1. Of note, numbers in the medium- and high-risk groups were decreased from 2665 and 643 to 2229 and 342 by subsequent AI-enabled ECG risk classification.
3.2. Baseline Characteristics of Patients
The demographic characteristics of these 11,508 patients are presented in Table 1. The mean age of the population was 63.6 ± 16.6 years, with males accounting for 53.6%. Level 3 triage accounted for the majority of cases (56.0%) in the ED. The mean AI-enabled ECG mortality risk score was 68.8 ± 24.1 for the first ECG, and 65.7 ± 24.1 for the subsequent ECG prior to discharge. The average duration from admission to discharge was 5.5 ± 4.4 h. Approximately 0.9% of patients experienced mortality during the 90-day follow-up period. The characteristics of the three risk categories predicted by AI-enabled ECG upon admission to the ED are detailed in Table 2. Significant differences were observed across the risk groups in age, sex distribution, vital signs, ECG-derived risk scores, triage levels, and clinical outcomes. Patients in the high-risk group were generally older (mean age: 69.98 years) and exhibited more severe triage classifications, with 10.3% assigned to level 1 acuity, compared to only 1.5% in the low-risk group. The second ECG mortality risk score, obtained prior to discharge, remained elevated in the high-risk cohort (mean: 87.4). The 90-day all-cause mortality rate increased proportionally with risk category: 0.3% in the low-risk group, 1.7% in the medium-risk group, and 3.9% in the high-risk group (p < 0.001). The characteristics of different risk transitions are presented in Appendix A Table A1, Table A2 and Table A3.
3.3. AI-Enabled ECG Risk Prediction
Figure 2 illustrates the receiver operating characteristic (ROC) curves of the ECG prediction models of all-cause mortality within 90 days, which demonstrate that the AUC for the first and follow-up ECGs prior to discharge were 78.6% and 83.3%, respectively. The positive predictive values were 2.1% and 2.8%, respectively, while the negative predictive values were both 99.7%. The follow-up ECG indicated better predictive ability compared to the initial ECG.
3.4. Mortality Risk of AI-Enabled ECG Categorical Changes
Cumulative death estimates for the four risk categories, based on risk scores generated by the AI model from the two ECGs, are shown as Kaplan–Meier curves in Figure 3. The incidences of 90-day mortality were 3.3% in the medium/high-to-medium/high risk group and 1.3% in the low-to-medium/high group, which were significantly higher (HR: 12.44; CI: 4.62–33.49; HR: 6.01; CI: 1.70–21.27) compared to the reference group (0.2%). Conversely, the incidence in the medium/high-to-low-risk group was 0.5%, which showed no significant difference (HR: 1.91; CI: 0.45–8.14) compared to the reference group. Overall, the concordance index (C-index) was 0.911.
4. Discussion
This study demonstrated that follow-up ECGs prior to discharge, analyzed by an AI-enhanced model, provided superior predictive accuracy for 90-day all-cause mortality compared to the initial ECG obtained at ED arrival. Importantly, patients who were initially classified as low-risk but transitioned to medium- or high-risk through follow-up ECGs had significantly increased mortality, highlighting the dynamic nature of clinical risk and the value of serial assessment.
The enhanced predictive accuracy of the follow-up ECG conducted prior to discharge can be attributed to several key factors. Firstly, patients with unstable conditions presenting at the ED may have masked signs due to rising catecholamine levels in acute stress, which makes the initial ECG less reliable for risk prediction [14]. Secondly, a patient’s condition may fluctuate or deteriorate progressively during their stay, and the initial ECG may not capture these dynamic changes [15]. Thirdly, patients may become more stable after appropriate management, which the follow-up ECG can capture more effectively than the initial ECG. For instance, follow-up ECGs are often obtained after specific interventions in the ED, such as treatments for paroxysmal supraventricular tachycardia or atrial fibrillation with rapid ventricular response. These interventions stabilize the patient, allowing the follow-up ECG to more accurately reflect their true clinical condition at the time of discharge. The results of our study reveal that performing a follow-up ECG is more precise in risk stratification, ensuring that patients receive a more accurate assessment prior to discharge.
Our findings suggest that patients classified as low-risk at discharge can generally be safely sent home, whereas those in the medium/high-risk categories should be monitored closely, particularly those who were initially classified as low-risk. Bounce-back admissions and deaths within 7 days after ED discharge have been reported in approximately 2.6% and 0.05% of cases, respectively [16,17,18]. Clinical risk stratification tools, such as the HEART (History, ECG, Age, Risk Factors, and Troponin) score and the EDACS (Emergency Department Assessment of Chest Pain Score) are widely used to assess patients presenting with chest pain and suspected of acute coronary syndrome (ACS). However, both HEART and EDACS scores rely on discrete clinical and biochemical variables, which may not capture the dynamic physiological changes that occur over time [19,20]. Dynamic assessment for risk evaluation can better identify critical changes that impact clinical outcomes for patients in the ED. For example, the biomarkers N-terminal pro-brain natriuretic peptide and cardiac troponin require serial evaluations to track risk changes accurately, necessitating prompt treatment [21,22,23]. While initial ECGs provide valuable information, they may underestimate risk due to acute changes in a patient’s condition. Performing serial ECGs during the ED stay allows for more precise and timely risk assessment, offering a more accurate prediction of patient outcomes [24]. This approach reduces the risk of underestimation and improves patient outcomes by ensuring that individuals who require additional monitoring or intervention receive appropriate care.
In our study, of the 88 patients who died within the 90-day follow-up period, 27 (30.7%) deaths were attributed to pulmonary causes, 26 (29.5%) to malignancies, 13 (14.8%) to cardiogenic causes, 9 (10.2%) to infectious etiologies, and 13 (14.8%) to other causes. Notably, 9.1% of patients transitioned from the low-risk to the medium/high-risk group between their initial and follow-up ECGs. Two patients who presented to the ED in out-of-hospital cardiac arrest due to fatal hyperkalemia and severe gastrointestinal bleeding. Of the deaths, forty-one were potentially preventable and were associated with conditions such as ischemic heart disease, congestive heart failure, arrhythmias, chronic obstructive pulmonary disease, and hyperkalemia. The average time to death following ED discharge was 49.1 ± 21.6 days, with the shortest time being 12 days, indicating that these deaths were likely influenced by both acute and subacute underlying issues. These findings highlight the important role of AI-enabled ECGs in predicting short-term mortality risks of various etiologies, providing clinical support for timely referrals and follow-up treatments. Therefore, the implementation of follow-up AI-enabled ECGs in the ED is recommended, as it has the potential to enhance decision-making processes and optimize resource allocation.
Previous research has demonstrated the potential of AI-driven ECG models in accurately predicting adverse outcomes, such as mortality and major cardiac events including acute myocardial infarction, stroke, and heart failure [12,25]. ECGs are valuable for risk stratification, and their integration with other diagnostic tools, such as chest X-rays, can significantly enhance risk prediction in emergency settings [26,27]. In cases of acute myocardial infarction, the prognostic importance of the initial ECG is well established [28]. However, studies have shown that dynamic ECG changes offer superior predictive value for sudden cardiac death compared to baseline and static ECGs [29]. Additionally, dynamic ECGs have been shown to predict the risk of heart failure hospitalization [30]. AI-enabled ECGs in the ED significantly enhance risk stratification, enabling timely interventions and ensuring that high-risk patients receive appropriate care, which may ultimately reduce readmissions and improve patient outcomes [31,32]. Our study further underscores the importance of follow-up ECGs in the ED, as they capture dynamic changes in a patient’s condition, providing a more accurate risk stratification. This sequential approach emphasizes the benefits of repeated ECG assessments in predicting adverse outcomes and guiding clinical decision-making more effectively.
AI-enhanced ECG analysis has advanced significantly, with deep learning-based models and large language models (LLMs) demonstrating effectiveness in detecting cardiac conditions, predicting adverse events, and improving clinical decision-making [33,34]. However, the clinical adoption of these models is often limited by the need for extensive computational resources and structured multimodal datasets. Our AI-enabled ECG model relies exclusively on ECG waveform analysis, eliminating the need for additional clinical inputs while maintaining strong predictive performance. This approach offers key advantages, including real-time applicability and dynamic risk stratification, making it a potentially valuable tool for ED decision support. By providing timely and accurate risk assessment, our model has the potential to optimize ED workflows, improve patient disposition, and enhance clinical outcomes.
This study has several limitations that should be acknowledged. Firstly, the data were derived from a single medical center, which may limit the generalizability of our findings to other settings with different patient populations. Secondly, the study included patients who underwent two ECGs during their ED visit. These patients generally presented with cardiopulmonary symptoms, making them a higher-risk group that may not fully represent the broader ED population. This selection bias could have influenced the study’s outcomes and may limit the applicability of the findings to lower-risk patients. Thirdly, the retrospective design inherently carries the risk of biases related to data collection, missing information, and the inability to control for all confounding variables. Although we attempted to control the known confounders, there may be unmeasured variables that influenced the results. In particular, because mortality was captured only within our hospital records, some out-of-system deaths may have been missed and introduced bias. Additionally, the study did not evaluate the cost-effectiveness or feasibility of implementing AI-enabled ECG technology on a large scale in routine ED practice. Finally, while we included an analysis of ECG features and patient characteristics to assess model interpretability, future work should evaluate their clinical utility in prospective, multi-center settings.
5. Conclusions
The AI-enabled follow-up ECG demonstrated superior predictive accuracy compared to the initial ECG, highlighting the potential value of performing a follow-up ECG to enhance risk stratification for ED patients. The follow-up ECG, conducted prior to discharge, may provide a more accurate assessment of short-term mortality risk, particularly for those initially deemed low risk. Patients being considered for discharge from the ED could benefit from an additional ECG to support a more precise evaluation of their risk profile. Although AI-enabled ECG interpretation alone may not replace comprehensive risk assessment, it offers a practical tool to support clinical judgment, particularly in the dynamic ED environment, by supporting safer discharge decisions. Future research should aim to validate these findings across multiple centers, using prospective study designs and assessing the cost-effectiveness and real-world implementation of AI-enabled ECGs in diverse clinical settings.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Calder L. Pozgay A. Riff S. Rothwell D. Youngson E. Mojaverian N. Cwinn A. Forster A. Adverse events in patients with return emergency department visits BMJ Qual. Saf.20152414214810.1136/bmjqs-2014-00319425540424 PMC 4316869 · doi ↗ · pubmed ↗
- 2Newman-Toker D.E. Peterson S.M. Badihian S. Hassoon A. Nassery N. Parizadeh D. Wilson L.M. Jia Y. Omron R. Tharmarajah S. AHRQ Comparative Effectiveness Reviews Diagnostic Errors in the Emergency Department: A Systematic Review Agency for Healthcare Research and Quality Rockville, MD, USA 202236574484 · pubmed ↗
- 3Capan M. Pigeon J. Marco D. Powell J. Groner K. We all make choices: A decision analysis framework for disposition decision in the ED Am. J. Emerg. Med.20183645045410.1016/j.ajem.2017.11.01829174450 · doi ↗ · pubmed ↗
- 4Hussain F. Cooper A. Carson-Stevens A. Donaldson L. Hibbert P. Hughes T. Edwards A. Diagnostic error in the emergency department: Learning from national patient safety incident report analysis BMC Emerg. Med.2019197710.1186/s 12873-019-0289-331801474 PMC 6894198 · doi ↗ · pubmed ↗
- 5Jarczok M.N. Weimer K. Braun C. Williams D.P. Thayer J.F. Gündel H.O. Balint E.M. Heart rate variability in the prediction of mortality: A systematic review and meta-analysis of healthy and patient populations Neurosci. Biobehav. Rev.202214310490710.1016/j.neubiorev.2022.10490736243195 · doi ↗ · pubmed ↗
- 6Sun X. Yin Y. Yang Q. Huo T. Artificial intelligence in cardiovascular diseases: Diagnostic and therapeutic perspectives Eur. J. Med. Res.20232824210.1186/s 40001-023-01065-y 37475050 PMC 10360247 · doi ↗ · pubmed ↗
- 7Sehrawat O. Kashou A.H. Noseworthy P.A. Artificial intelligence and atrial fibrillation J. Cardiovasc. Electrophysiol.2022331932194310.1111/jce.1544035258136 PMC 9717694 · doi ↗ · pubmed ↗
- 8Zhang P.I. Hsu C.C. Kao Y. Chen C.J. Kuo Y.W. Hsu S.L. Liu T.L. Lin H.J. Wang J.J. Liu C.F. Real-time AI prediction for major adverse cardiac events in emergency department patients with chest pain Scand. J. Trauma. Resusc. Emerg. Med.2020289310.1186/s 13049-020-00786-x 32917261 PMC 7488862 · doi ↗ · pubmed ↗
