BrainWhisperer: Leveraging Large-Scale ASR Models for Neural Speech Decoding
Tommaso Boccato, Michal Olak, Matteo Ferrante

TL;DR
BrainWhisperer is a novel neural speech decoder that combines intracortical recordings with a large pretrained ASR model, achieving improved accuracy and generalization for brain-computer interfaces in speech decoding.
Contribution
It introduces a hybrid neural decoding model that integrates MEA data with a modified Whisper ASR, incorporating domain-informed modifications for better cross-subject and cross-dataset generalization.
Findings
Outperforms prior state-of-the-art decoders on MEA datasets.
Cross-dataset training enhances performance without fine-tuning.
Supports dual decoding paths for accuracy and speed.
Abstract
Decoding continuous speech from intracortical recordings is a central challenge for brain-computer interfaces (BCIs), with transformative potential for individuals with conditions that impair their ability to speak. While recent microelectrode array (MEA) decoders achieve impressive accuracy, their performance is fundamentally limited by the small size of existing datasets, they remain brittle to session-to-session variability, and their ability to generalize across participants remains unexplored. We introduce BrainWhisperer, a neural speech decoder that integrates high-resolution MEA recordings with a large pretrained automatic speech recognition (ASR) model. Building on interpretability findings showing that Whisper's encoder learns phoneme-selective representations with localized attention, we train a customized version of Whisper, modified to process neural features, using a hybrid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Speech Recognition and Synthesis · Neurobiology of Language and Bilingualism
