Decoding the decoder: Contextual sequence-to-sequence modeling for intracortical speech decoding
Michal Olak, Tommaso Boccato, Matteo Ferrante

TL;DR
This paper introduces a Transformer-based sequence-to-sequence model with a novel calibration method for intracortical speech decoding, achieving state-of-the-art accuracy and providing insights into neural speech processing.
Contribution
It presents a multitask Transformer model with a new calibration module that enhances speech decoding robustness and interpretability from intracortical recordings.
Findings
Achieved 14.3% phoneme error rate on Willett dataset.
Word decoding accuracy reached 25.6% WER with direct decoding.
The Neural Hammer Scalpel (NHS) calibration improved decoding performance.
Abstract
Speech brain--computer interfaces require decoders that translate intracortical activity into linguistic output while remaining robust to limited data and day-to-day variability. While prior high-performing systems have largely relied on framewise phoneme decoding combined with downstream language models, it remains unclear what contextual sequence-to-sequence decoding contributes to sublexical neural readout, robustness, and interpretability. We evaluated a multitask Transformer-based sequence-to-sequence model for attempted speech decoding from area 6v intracortical recordings. The model jointly predicts phoneme sequences, word sequences, and auxiliary acoustic features. To address day-to-day nonstationarity, we introduced the Neural Hammer Scalpel (NHS) calibration module, which combines global alignment with feature-wise modulation. We further analyzed held-out-day generalization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Functional Brain Connectivity Studies · Neurobiology of Language and Bilingualism
