# Artificial Intelligence Enhanced Electrocardiogram Analysis for Age and Sex Classification in Youth

**Authors:** Honggen Zhang, Mohammad Zaeri-Amirani, Mojtaba Abolfazli, Narayana P. Santhanam, June Zhang, Anders Høst-Madsen, Chieko Kimata, James C. Perry, Andras Bratincsak

PMC · DOI: 10.21203/rs.3.rs-7512909/v1 · Research Square · 2025-10-19

## TL;DR

This study uses machine learning to improve ECG analysis in children and adolescents by creating age- and sex-specific standards.

## Contribution

The novel use of machine learning models to derive age- and sex-specific ECG standards in youth.

## Key findings

- Support vector machines achieved the highest accuracy in predicting age and sex from ECG data.
- Key ECG features like heart rate and QRS duration were most predictive of age and sex.
- Age-group classification improved significantly when allowing one-group misclassification.

## Abstract

Electrocardiogram (ECG) values vary significantly across age and sex, particularly during childhood and adolescence. While age- and sex-specific ECG standards exist, they often fail to capture complex multi-dimensional relationships and have not been applied in machine learning (ML) enhanced ECG analysis. Accuracy of automated ECG analysis in clinical practice improved significantly by applying ML models, however there is a paucity of such studies in the pediatric population. Our aim was to create age- and sex-specific standards for children by ML modeling.

We analyzed 29,408 curated resting 12-lead ECGs from healthy subjects aged 0–21 years using 177 digitized ECG variables combined with various ML models including regression and classification analyses and semi-supervised neural networks. Primary outcome variables were age and sex. Model performance was evaluated using F1-score, AUROC, and confusion matrices across repeated train-test splits.

Support vector machine (SVM) achieved the highest accuracy in modeling both age and sex. Key predictive features included heart rate, PR interval, QRS duration, and T-wave amplitude. Age-group classification achieved an average true positive rate of 60% with SVM, improving to 94% when allowing one-group misclassification. Sex classification reached F1-scores of 0.91 and AUROC of 0.95 in adolescents and young adults, and moderate accuracy in younger children.

Traditional supervised ML models can accurately model physiologic ECG changes related to age and sex, outperforming semi-supervised models, particularly in smaller subgroups. These findings support the development of age- and sex-specific ML-enhanced ECG standards to aid future research and clinical applications in pediatric cardiology.

## Full-text entities

- **Diseases:** syncope (MESH:D013575), transposition of the great arteries (MESH:D014188), tachycardia (MESH:D013610), irregular heartbeat (MESH:D005117), AI (MESH:C538142), conditions (MESH:D020763), heart murmur (MESH:D006337), dizziness (MESH:D004244), conduction defects (MESH:D019955), left ventricular dysfunction (MESH:D018487), fever (MESH:D005334), bradycardia (MESH:D001919), ventricular dysfunction (MESH:D018754), congenital heart defects (MESH:D006330), LQTS (MESH:D008133), HCM (MESH:D002312), cardiomyopathies (MESH:D009202), arrhythmia (MESH:D001145), cardiac anomaly (MESH:D006331), ventricular enlargement or hypertrophy (MESH:D024741)
- **Chemicals:** T-BiGAN (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12633205/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12633205/full.md

## References

17 references — full list in the complete paper: https://tomesphere.com/paper/PMC12633205/full.md

---
Source: https://tomesphere.com/paper/PMC12633205