Spoken Speech Enhancement using EEG

Gautam Krishna; Co Tran; Yan Han; Mason Carnahan; Ahmed H Tewfik

arXiv:1909.09132·eess.AS·April 21, 2020

Spoken Speech Enhancement using EEG

Gautam Krishna, Co Tran, Yan Han, Mason Carnahan, Ahmed H Tewfik

PDF

Open Access

TL;DR

This paper introduces a novel approach to speech enhancement using EEG signals and various neural network models, demonstrating significant improvements over traditional methods in noisy environments.

Contribution

First demonstration of spoken speech enhancement using EEG signals with multiple neural network models, showing improved quality over traditional algorithms.

Findings

01

EEG features can effectively clean speech in noisy settings.

02

GAN, GRU, TCN, and mixed models outperform traditional log MMSE.

03

First use of EEG for speech enhancement in parallel recordings.

Abstract

In this paper we demonstrate spoken speech enhancement using electroencephalography (EEG) signals using a generative adversarial network (GAN) based model, gated recurrent unit (GRU) regression based model, temporal convolutional network (TCN) regression model and finally using a mixed TCN GRU regression model. We compare our EEG based speech enhancement results with traditional log minimum mean-square error (MMSE) speech enhancement algorithm and our proposed methods demonstrate significant improvement in speech enhancement quality compared to the traditional method. Our overall results demonstrate that EEG features can be used to clean speech recorded in presence of background noise. To the best of our knowledge this is the first time a spoken speech enhancement is demonstrated using EEG features recorded in parallel with spoken speech.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Blind Source Separation Techniques · Advanced Adaptive Filtering Techniques

MethodsGated Recurrent Unit