BrainECHO: Semantic Brain Signal Decoding through Vector-Quantized Spectrogram Reconstruction for Whisper-Enhanced Text Generation
Jilong Li, Zhenxi Song, Jiaqi Wang, Meishan Zhang, Honghai Liu, Min Zhang, Zhiguo Zhang

TL;DR
BrainECHO introduces a multi-stage framework for decoding brain signals into text, overcoming robustness, noise sensitivity, and alignment issues, achieving state-of-the-art results on EEG and MEG datasets.
Contribution
It presents a novel decoupled representation learning approach with discrete autoencoding, frozen alignment, and constrained decoding fine-tuning for brain-to-text translation.
Findings
Achieves 3.65% BLEU-4 improvement with vector-quantized reconstruction.
Maintains high decoding accuracy (74%-89% BLEU scores) without teacher forcing.
Demonstrates robustness across sessions, subjects, and noise conditions.
Abstract
Current EEG/MEG-to-text decoding systems suffer from three key limitations: (1) reliance on teacher-forcing methods, which compromises robustness during inference, (2) sensitivity to session-specific noise, hindering generalization across subjects, and (3) misalignment between brain signals and linguistic representations due to pre-trained language model over-dominance. To overcome these challenges, we propose BrainECHO (Brain signal decoding via vEctor-quantized speCtrogram reconstruction for WHisper-enhanced text generatiOn), a multi-stage framework that employs decoupled representation learning to achieve state-of-the-art performance on both EEG and MEG datasets. Specifically, BrainECHO consists of three stages: (1) Discrete autoencoding, which transforms continuous Mel spectrograms into a finite set of high-quality discrete representations for subsequent stages. (2) Frozen…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques
MethodsSparse Evolutionary Training
