# Biomarker Discovery for Autism Prediction Using Massive Feature Extraction Based on EEG Signals

**Authors:** Nauman Hafeez, Abdul Rehman Aslam, Muhammad Awais Bin Altaf

PMC · DOI: 10.3390/s26061862 · Sensors (Basel, Switzerland) · 2026-03-16

## TL;DR

This paper introduces a new method using EEG signals and machine learning to accurately predict autism with high accuracy by extracting and analyzing a large number of features.

## Contribution

The novel contribution is a framework combining HCTSA-based feature extraction and hybrid feature selection for ASD classification from resting-state EEG.

## Key findings

- The proposed framework achieved 100% accuracy with ≥50 features on a balanced dataset of 56 participants.
- The most discriminating EEG channels and features were identified using Shapley values for model explainability.

## Abstract

Autism spectrum disorder (ASD) is a heterogeneous neurodevelopmental disorder that requires early diagnosis for better intervention. However, current clinical behavioural examinations are time-consuming and prone to human error. Objective and effective biomarkers are essential for the diagnosis and prognosis of the disorder. Electroencephalography (EEG) is a non-invasive and inexpensive brain-imaging technique that is widely applied in the diagnosis of ASD. Feature-based methods have shown better performance in EEG-based applications. Here, we present a prediction framework based on massive feature extraction using the highly comparative time-series analysis (HCTSA) method and a hybrid feature selection method for the classification of ASD from resting-state EEG. Machine-learning models are trained and tested on a different number of selected features. Our models demonstrated 100% accuracy with ≥50 features on a balanced dataset of 56 participants. The most discriminating EEG channels and features were used in the prediction process, as well as those using Shapley values to provide explainability of our framework. Whilst these results are promising, we acknowledge the limitations of a single small-scale dataset and emphasise the need for validation on larger independent cohorts before clinical translation.

## Linked entities

- **Diseases:** autism spectrum disorder (MONDO:0005258), ASD (MONDO:0006664)

## Full-text entities

- **Diseases:** Autism (MESH:D001321), ASD (MESH:D000067877), neurodevelopmental disorder (MESH:D002658)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13030706/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13030706/full.md

## References

49 references — full list in the complete paper: https://tomesphere.com/paper/PMC13030706/full.md

---
Source: https://tomesphere.com/paper/PMC13030706