End-to-end translation of human neural activity to speech with a   dual-dual generative adversarial network

Yina Guo; Xiaofei Zhang; Zhenying Gong; Anhong Wang; Wenwu Wang

arXiv:2110.06634·cs.SD·March 29, 2022

End-to-end translation of human neural activity to speech with a dual-dual generative adversarial network

Yina Guo, Xiaofei Zhang, Zhenying Gong, Anhong Wang, Wenwu Wang

PDF

Open Access

TL;DR

This paper introduces an end-to-end neural network model using a dual-dual GAN to translate human neural activity directly into speech, improving accuracy over previous two-step methods.

Contribution

It proposes a novel end-to-end translation model with a dual-dual GAN and a transition domain, enabling one-to-one neural-to-speech translation without information loss.

Findings

01

Outperforms state-of-the-art methods on word and sentence translation tasks.

02

Successfully translates neural activity sequences of varying lengths into speech.

03

Creates a new EEG dataset with attention detection for better neural signal quality.

Abstract

In a recent study of auditory evoked potential (AEP) based brain-computer interface (BCI), it was shown that, with an encoder-decoder framework, it is possible to translate human neural activity to speech (T-CAS). However, current encoder-decoder-based methods achieve T-CAS often with a two-step method where the information is passed between the encoder and decoder with a shared dimension reduction vector, which may result in a loss of information. A potential approach to this problem is to design an end-to-end method by using a dual generative adversarial network (DualGAN) without dimension reduction of passing information, but it cannot realize one-to-one signal-to-signal translation (see Fig.1 (a) and (b)). In this paper, we propose an end-to-end model to translate human neural activity to speech directly, create a new electroencephalogram (EEG) datasets for participants with good…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEEG and Brain-Computer Interfaces · Neural dynamics and brain function · Blind Source Separation Techniques