NeuroIncept Decoder for High-Fidelity Speech Reconstruction from Neural Activity
Owais Mujtaba Khanday, Jos\'e L. P\'erez-C\'ordoba, Mohd Yaqub Mir, Ashfaq Ahmad Najar, Jose A. Gonzalez-Lopez

TL;DR
This study presents a novel neural network-based algorithm that reconstructs speech spectrograms from invasive EEG recordings, aiming to improve communication for individuals with speech impairments.
Contribution
The paper introduces the NeuroIncept Decoder, combining CNNs and GRUs to enhance speech reconstruction from neural activity, advancing brain-computer interface capabilities.
Findings
Robust correlation between predicted and actual spectrograms
Inter-subject variability in neural decoding performance
Potential for restoring communication in speech-impaired individuals
Abstract
This paper introduces a novel algorithm designed for speech synthesis from neural activity recordings obtained using invasive electroencephalography (EEG) techniques. The proposed system offers a promising communication solution for individuals with severe speech impairments. Central to our approach is the integration of time-frequency features in the high-gamma band computed from EEG recordings with an advanced NeuroIncept Decoder architecture. This neural network architecture combines Convolutional Neural Networks (CNNs) and Gated Recurrent Units (GRUs) to reconstruct audio spectrograms from neural patterns. Our model demonstrates robust mean correlation coefficients between predicted and actual spectrograms, though inter-subject variability indicates distinct neural processing mechanisms among participants. Overall, our study highlights the potential of neural decoding techniques to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Speech and Audio Processing · Speech Recognition and Synthesis
