# Balancing noise reduction and neural signature preservation in EEG biometrics

**Authors:** Muhammad Usman, Nadia Sultan, Ammara Nasim, Beenish Ayaz, Joddat Fatima, Faryal Nosheen

PMC · DOI: 10.1038/s41598-026-36840-4 · Scientific Reports · 2026-01-30

## TL;DR

This paper introduces a framework for improving EEG-based biometric identification by balancing noise reduction and preserving neural signatures, achieving high accuracy with consumer-grade hardware.

## Contribution

A novel framework integrating lenient preprocessing, spectral features, and ensemble classification for robust EEG biometrics.

## Key findings

- XGBoost achieved 98% accuracy using Visual Evoked Potential Complex stimulation on cleaned data.
- The proposed pipeline improved robustness compared to raw and conventionally processed data.
- Rest Closed Eyes emerged as the most stable paradigm for cross-session evaluation.

## Abstract

EEG-based subject identification is an emerging biometric approach with strong potential for secure authentication, but reliable performance requires optimisation of the entire processing pipeline. The key difficulty lies in improving signal quality while preserving the subtle neural signatures that uniquely distinguish individuals . In this study, we propose a complete framework that integrates lenient preprocessing, spectral feature extraction, and ensemble classification. Using the Brain Encoding Dataset(BED), we evaluated three data variants: raw EEG recordings, signals processed with a modified Pre-processing (PREP) pipeline using relaxed thresholds, and expert-curated pre-extracted features. All datasets were analyzed with mel-frequency cepstral coefficients(MFCC), and classification was performed within an ensemble architecture that combined decision trees, random forests, support vector machines, and XGBoost. The experiments covered 21 subjects, 33 sessions, and twelve stimulus conditions including resting state, cognitive tasks, and visual evoked potentials. XGBoost achieved peak accuracy of 98.00% using Visual Evoked Potential Complex stimulation at 10 Hz on cleaned data, representing a 5.3% improvement over raw signals and an 8.4% improvement over pre-extracted features. Statistical validation confirmed that these improvements are robust across all experimental conditions at (\documentclass[12pt]{minimal}
				\usepackage{amsmath}
				\usepackage{wasysym} 
				\usepackage{amsfonts} 
				\usepackage{amssymb} 
				\usepackage{amsbsy}
				\usepackage{mathrsfs}
				\usepackage{upgreek}
				\setlength{\oddsidemargin}{-69pt}
				\begin{document}$$p < 0.01$$\end{document}). Cross-session evaluation further demonstrated the expected temporal variability in EEG-based biometrics but showed that the proposed pipeline improves robustness compared with both raw and conventionally processed data, with Rest Closed Eyes emerging as the most stable paradigm. These findings establish a principled framework for EEG-based subject identification and provide practical guidelines for optimizing preprocessing, feature extraction, classification, and stimulus paradigms for real-world deployment with consumer-grade hardware and system approach.

## Full-text entities

- **Genes:** SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}
- **Diseases:** PSD (MESH:C536311), fatigue (MESH:D005221), Confusion (MESH:D003221), BED (MESH:C564021)
- **Chemicals:** VEP (MESH:C047598), HAPPE (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12914031/full.md

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12914031/full.md

## References

3 references — full list in the complete paper: https://tomesphere.com/paper/PMC12914031/full.md

---
Source: https://tomesphere.com/paper/PMC12914031