NeuroIncept Decoder for High-Fidelity Speech Reconstruction from Neural Activity

Owais Mujtaba Khanday; Jos\'e L. P\'erez-C\'ordoba; Mohd Yaqub Mir; Ashfaq Ahmad Najar; Jose A. Gonzalez-Lopez

arXiv:2501.03757·cs.SD·May 21, 2025

NeuroIncept Decoder for High-Fidelity Speech Reconstruction from Neural Activity

Owais Mujtaba Khanday, Jos\'e L. P\'erez-C\'ordoba, Mohd Yaqub Mir, Ashfaq Ahmad Najar, Jose A. Gonzalez-Lopez

PDF

Open Access 1 Repo

TL;DR

This study presents a novel neural network-based algorithm that reconstructs speech spectrograms from invasive EEG recordings, aiming to improve communication for individuals with speech impairments.

Contribution

The paper introduces the NeuroIncept Decoder, combining CNNs and GRUs to enhance speech reconstruction from neural activity, advancing brain-computer interface capabilities.

Findings

01

Robust correlation between predicted and actual spectrograms

02

Inter-subject variability in neural decoding performance

03

Potential for restoring communication in speech-impaired individuals

Abstract

This paper introduces a novel algorithm designed for speech synthesis from neural activity recordings obtained using invasive electroencephalography (EEG) techniques. The proposed system offers a promising communication solution for individuals with severe speech impairments. Central to our approach is the integration of time-frequency features in the high-gamma band computed from EEG recordings with an advanced NeuroIncept Decoder architecture. This neural network architecture combines Convolutional Neural Networks (CNNs) and Gated Recurrent Units (GRUs) to reconstruct audio spectrograms from neural patterns. Our model demonstrates robust mean correlation coefficients between predicted and actual spectrograms, though inter-subject variability indicates distinct neural processing mechanisms among participants. Overall, our study highlights the potential of neural decoding techniques to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

owaismujtaba/NeuroInceptDecoder
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Speech and Audio Processing · Speech Recognition and Synthesis