EEGMamba: Bidirectional State Space Model with Mixture of Experts for EEG Multi-task Classification
Yiyu Gui, MingZhi Chen, Yuqi Su, Guibo Luo, Yuchao Yang

TL;DR
EEGMamba is a universal multi-task EEG classification network that combines adaptive feature extraction, bidirectional Mamba, and mixture of experts to handle various EEG tasks efficiently and accurately.
Contribution
This paper introduces EEGMamba, the first universal EEG classification model capable of multi-task learning across different EEG applications with a unified framework.
Findings
Outperforms existing models on eight EEG datasets.
Effective in seizure detection, emotion recognition, sleep staging, and motor imagery.
Balances accuracy, speed, and memory efficiency.
Abstract
In recent years, with the development of deep learning, electroencephalogram (EEG) classification networks have achieved certain progress. Transformer-based models can perform well in capturing long-term dependencies in EEG signals. However, their quadratic computational complexity poses a substantial computational challenge. Moreover, most EEG classification models are only suitable for single tasks and struggle with generalization across different tasks, particularly when faced with variations in signal length and channel count. In this paper, we introduce EEGMamba, the first universal EEG classification network to truly implement multi-task learning for EEG applications. EEGMamba seamlessly integrates the Spatio-Temporal-Adaptive (ST-Adaptive) module, bidirectional Mamba, and Mixture of Experts (MoE) into a unified framework. The proposed ST-Adaptive module performs unified feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · EEG and Brain-Computer Interfaces · Fault Detection and Control Systems
MethodsSparse Evolutionary Training · Convolution · Mixture of Experts · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Mamba: Linear-Time Sequence Modeling with Selective State Spaces
