NeuSpeech: Decode Neural signal as Speech
Yiqian Yang, Yiqun Duan, Qiang Zhang, Hyejeong Jo, Jinni Zhou, Won Hee, Lee, Renjing Xu, Hui Xiong

TL;DR
This paper introduces NeuSpeech, a novel cross-attention-based model for decoding speech directly from non-invasive MEG neural signals, achieving high accuracy without teacher forcing and advancing brain-computer interface capabilities.
Contribution
First to apply a cross-attention 'whisper' model for MEG-based speech decoding without teacher forcing, outperforming previous methods on major datasets.
Findings
Achieved BLEU-1 scores of 60.30 and 52.89 on two datasets.
Demonstrated effectiveness of the model without pretraining or teacher forcing.
Provided comprehensive analysis of neural speech decoding techniques.
Abstract
Decoding language from brain dynamics is an important open direction in the realm of brain-computer interface (BCI), especially considering the rapid growth of large language models. Compared to invasive-based signals which require electrode implantation surgery, non-invasive neural signals (e.g. EEG, MEG) have attracted increasing attention considering their safety and generality. However, the exploration is not adequate in three aspects: 1) previous methods mainly focus on EEG but none of the previous works address this problem on MEG with better signal quality; 2) prior works have predominantly used during generative decoding, which is impractical; 3) prior works are mostly not fully auto-regressive, which performs better in other sequence tasks. In this paper, we explore the brain-to-text translation of MEG signals in a speech-decoding formation.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDeception detection and forensic psychology
MethodsSparse Evolutionary Training · Focus
