MEBM-Speech: Multi-scale Enhanced BrainMagic for Robust MEG Speech Detection

Li Songyi; Zheng Linze; Liang Jinghua; Zhang Zifeng

arXiv:2603.02255·cs.SD·March 4, 2026

MEBM-Speech: Multi-scale Enhanced BrainMagic for Robust MEG Speech Detection

Li Songyi, Zheng Linze, Liang Jinghua, Zhang Zifeng

PDF

Open Access

TL;DR

MEBM-Speech is a multi-scale neural decoder that enhances MEG-based speech detection by integrating various temporal modeling techniques, achieving high accuracy and robustness for cognitive and clinical applications.

Contribution

The paper introduces MEBM-Speech, a novel multi-scale neural architecture with combined temporal modules and strategies for robust MEG speech activity detection.

Findings

01

Achieved 89.3% F1 macro on LibriBrain benchmark.

02

Demonstrated robustness in boundary detection and onset timing.

03

Outperformed existing methods in MEG speech decoding.

Abstract

We propose MEBM-Speech, a multi-scale enhanced neural decoder for speech activity detection from non-invasive magnetoencephalography (MEG) signals. Built upon the BrainMagic backbone, MEBM-Speech integrates three complementary temporal modeling mechanisms: a multi-scale convolutional module for short-term pattern extraction, a bidirectional LSTM (BiLSTM) for long-range context modeling, and a depthwise separable convolutional layer for efficient cross-scale feature fusion. A lightweight temporal jittering strategy and average pooling further improve onset robustness and boundary stability. The model performs continuous probabilistic decoding of MEG signals, enabling fine-grained detection of speech versus silence states - an ability crucial for both cognitive neuroscience and clinical applications. Comprehensive evaluations on the LibriBrain Competition 2025 Track1 benchmark demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEEG and Brain-Computer Interfaces · Emotion and Mood Recognition · Speech Recognition and Synthesis