Zero-Shot Imagined Speech Decoding via Imagined-to-Listened MEG Mapping
Maryam Maghsoudi, Shihab Shamma

TL;DR
This study introduces a novel three-stage pipeline for decoding imagined speech from MEG recordings by leveraging listening responses, demonstrating significant above-chance decoding and scalability with training data.
Contribution
It presents a new approach that maps imagined MEG responses to listened responses, enabling effective decoding of imagined speech using a multi-stage neural decoding pipeline.
Findings
Imagined words are decodable significantly above chance.
Performance improves with increased training data.
The approach is scalable and applicable to brain-computer interfaces.
Abstract
Decoding imagined speech from non-invasive brain recordings is challenging because imagined datasets are scarce and difficult to align temporally across subjects and sessions In this work, we propose a new approach to the decoding of imagined speech that leverages the richer and more reliably labeled recordings during listening to speech. We collected paired listened and imagined MEG recordings to rhythmic melodic and spoken stimuli from trained musicians. Using trained musicians helped improve temporal alignment across conditions. We then developed a three-stage decoding pipeline that revealed consistent and meaningful relationships between neural activity evoked by imagining and listening to the same stimuli. First, we trained six linear and neural models to map imagined MEG responses to listened responses. We evaluated these models against a null baseline from unseen subjects to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
