Transfer Learning from ImageNet for MEG-Based Decoding of Imagined Speech

Soufiane Jhilal; St\'ephanie Martin; Anne-Lise Giraud

arXiv:2601.15909·cs.CL·January 23, 2026

Transfer Learning from ImageNet for MEG-Based Decoding of Imagined Speech

Soufiane Jhilal, St\'ephanie Martin, Anne-Lise Giraud

PDF

Open Access

TL;DR

This paper demonstrates that transforming MEG signals into image-like representations and applying pretrained vision models significantly improves decoding of imagined speech, achieving high accuracy and revealing shared neural patterns.

Contribution

It introduces an innovative image-based approach for MEG decoding of imagined speech using pretrained vision models, outperforming classical methods.

Findings

01

Achieved up to 90.4% accuracy in imagined speech detection.

02

Pretrained vision models capture shared neural representations across subjects.

03

Temporal analysis localized discriminative information to specific intervals.

Abstract

Non-invasive decoding of imagined speech remains challenging due to weak, distributed signals and limited labeled data. Our paper introduces an image-based approach that transforms magnetoencephalography (MEG) signals into time-frequency representations compatible with pretrained vision models. MEG data from 21 participants performing imagined speech tasks were projected into three spatial scalogram mixtures via a learnable sensor-space convolution, producing compact image-like inputs for ImageNet-pretrained vision architectures. These models outperformed classical and non-pretrained models, achieving up to 90.4% balanced accuracy for imagery vs. silence, 81.0% vs. silent reading, and 60.6% for vowel decoding. Cross-subject evaluation confirmed that pretrained models capture shared neural representations, and temporal analyses localized discriminative information to imagery-locked…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPhonetics and Phonology Research · Speech Recognition and Synthesis · Emotion and Mood Recognition