MEBM-Speech: Multi-scale Enhanced BrainMagic for Robust MEG Speech Detection
Li Songyi, Zheng Linze, Liang Jinghua, Zhang Zifeng

TL;DR
MEBM-Speech is a multi-scale neural decoder that enhances MEG-based speech detection by integrating various temporal modeling techniques, achieving high accuracy and robustness for cognitive and clinical applications.
Contribution
The paper introduces MEBM-Speech, a novel multi-scale neural architecture with combined temporal modules and strategies for robust MEG speech activity detection.
Findings
Achieved 89.3% F1 macro on LibriBrain benchmark.
Demonstrated robustness in boundary detection and onset timing.
Outperformed existing methods in MEG speech decoding.
Abstract
We propose MEBM-Speech, a multi-scale enhanced neural decoder for speech activity detection from non-invasive magnetoencephalography (MEG) signals. Built upon the BrainMagic backbone, MEBM-Speech integrates three complementary temporal modeling mechanisms: a multi-scale convolutional module for short-term pattern extraction, a bidirectional LSTM (BiLSTM) for long-range context modeling, and a depthwise separable convolutional layer for efficient cross-scale feature fusion. A lightweight temporal jittering strategy and average pooling further improve onset robustness and boundary stability. The model performs continuous probabilistic decoding of MEG signals, enabling fine-grained detection of speech versus silence states - an ability crucial for both cognitive neuroscience and clinical applications. Comprehensive evaluations on the LibriBrain Competition 2025 Track1 benchmark demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Emotion and Mood Recognition · Speech Recognition and Synthesis
