SSM2Mel: State Space Model to Reconstruct Mel Spectrogram from the EEG

Cunhang Fan; Sheng Zhang; Jingjing Zhang; Zexu Pan; Zhao Lv

arXiv:2501.10402·eess.SP·January 22, 2025

SSM2Mel: State Space Model to Reconstruct Mel Spectrogram from the EEG

Cunhang Fan, Sheng Zhang, Jingjing Zhang, Zexu Pan, Zhao Lv

PDF

Open Access

TL;DR

This paper introduces SSM2Mel, a novel state space model that reconstructs continuous speech mel spectrograms from EEG signals, significantly improving decoding accuracy for imagined speech.

Contribution

The paper presents a new SSM-based framework with modules like Mamba, S4-UNet, and ESM to enhance continuous speech reconstruction from EEG, addressing previous limitations.

Findings

01

Achieved a Pearson correlation of 0.069, a 38% improvement over baseline.

02

Effectively models long EEG sequences with the Mamba module.

03

Enhances local feature extraction using S4-UNet.

Abstract

Decoding speech from brain signals is a challenging research problem that holds significant importance for studying speech processing in the brain. Although breakthroughs have been made in reconstructing the mel spectrograms of audio stimuli perceived by subjects at the word or letter level using noninvasive electroencephalography (EEG), there is still a critical gap in precisely reconstructing continuous speech features, especially at the minute level. To address this issue, this paper proposes a State Space Model (SSM) to reconstruct the mel spectrogram of continuous speech from EEG, named SSM2Mel. This model introduces a novel Mamba module to effectively model the long sequence of EEG signals for imagined speech. In the SSM2Mel model, the S4-UNet structure is used to enhance the extraction of local features of EEG signals, and the Embedding Strength Modulator (ESM) module is used to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Neural dynamics and brain function · EEG and Brain-Computer Interfaces

MethodsMamba: Linear-Time Sequence Modeling with Selective State Spaces