# Toward automated neonatal EEG analysis: multi-center validation of a reliable deep learning pipeline

**Authors:** Tim Hermans, Anneleen Dereymaeker, Katrien Lemmens, Katrien Jansen, Fatima Usman, Shellie Robinson, Gunnar Naulaers, Maarten De Vos, Caroline Hartley

PMC · DOI: 10.3389/fnins.2026.1750045 · 2026-02-27

## TL;DR

This paper introduces NeoNaid, an automated tool for analyzing neonatal EEGs, validated across two hospitals to ensure reliability and accuracy in estimating brain age and sleep stages.

## Contribution

The novel contribution is a deep learning pipeline with integrated quality control that reliably estimates functional brain age and sleep stages in neonatal EEGs across different clinical settings.

## Key findings

- NeoNaid achieved median absolute FBA errors of 0.50 weeks in the internal dataset and 0.55 weeks in the external dataset.
- Sleep staging accuracy had Cohen’s Kappa values of 0.89 and 0.87 for quiet sleep detection in internal and external datasets, respectively.
- Quality control routines reduced extreme errors and improved reliability across both datasets.

## Abstract

To evaluate the reliability and generalization of NeoNaid, a fully automated software tool for neonatal EEG analysis, based on functional brain age (FBA) estimation and sleep staging.

NeoNaid combines a multi-task deep learning model with proposed quality control routines detecting artifacts, out-of-distribution inputs, and uncertain predictions. Based on a raw EEG input, it outputs one global FBA estimate and a continuous 2-state hypnogram. We validated performance on two independent hospital settings: an internal dataset (33 EEGs, 17 infants, median 900 min/recording) and an external dataset (38 EEGs, 24 infants, median 124 min/recording).

Quality control rejected a comparable number of segments in the internal and external datasets, reducing extreme errors in FBA estimation, and modestly improving sleep staging accuracy. Across the internal and external data, NeoNaid achieved median absolute FBA errors of 0.50 and 0.55 weeks and Cohen’s Kappa values of 0.89 and 0.87 for quiet sleep detection, respectively.

NeoNaid demonstrated improved reliability through integrated quality control and maintained performance across two independent datasets. By focusing on validation and trustworthiness, this work takes an essential step toward clinical adoption of automated neonatal EEG analysis and supports its utility for both NICU practice and large-scale research.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12982431/full.md

---
Source: https://tomesphere.com/paper/PMC12982431