End-to-end translation of human neural activity to speech with a dual-dual generative adversarial network
Yina Guo, Xiaofei Zhang, Zhenying Gong, Anhong Wang, Wenwu Wang

TL;DR
This paper introduces an end-to-end neural network model using a dual-dual GAN to translate human neural activity directly into speech, improving accuracy over previous two-step methods.
Contribution
It proposes a novel end-to-end translation model with a dual-dual GAN and a transition domain, enabling one-to-one neural-to-speech translation without information loss.
Findings
Outperforms state-of-the-art methods on word and sentence translation tasks.
Successfully translates neural activity sequences of varying lengths into speech.
Creates a new EEG dataset with attention detection for better neural signal quality.
Abstract
In a recent study of auditory evoked potential (AEP) based brain-computer interface (BCI), it was shown that, with an encoder-decoder framework, it is possible to translate human neural activity to speech (T-CAS). However, current encoder-decoder-based methods achieve T-CAS often with a two-step method where the information is passed between the encoder and decoder with a shared dimension reduction vector, which may result in a loss of information. A potential approach to this problem is to design an end-to-end method by using a dual generative adversarial network (DualGAN) without dimension reduction of passing information, but it cannot realize one-to-one signal-to-signal translation (see Fig.1 (a) and (b)). In this paper, we propose an end-to-end model to translate human neural activity to speech directly, create a new electroencephalogram (EEG) datasets for participants with good…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Neural dynamics and brain function · Blind Source Separation Techniques
