MoDAl: Self-Supervised Neural Modality Discovery via Decorrelation for Speech Neuroprosthesis
Yuanhao Chen, Peter Chin

TL;DR
MoDAl introduces a self-supervised framework that discovers diverse neural modalities for speech decoding by balancing alignment with language models and decorrelation to prevent redundancy, improving accuracy in speech neuroprosthesis.
Contribution
It proposes a novel modality discovery method using contrastive and decorrelation losses, revealing functional specialization in neural signals for speech decoding.
Findings
Reduced word error rate from 26.3% to 21.6% on Brain-to-Text Benchmark '24.
Discovered that area 44 encodes structural and syntactic information.
Decorrelation mechanism enhances the diversity of neural modalities.
Abstract
Speech neuroprosthesis systems decode intended speech from neural activity in the absence of audible output, offering a path to restoring communication for individuals with speech-impairing conditions. Current approaches decode predominantly from motor cortical areas, discarding others -- such as area 44, part of Broca's area -- that may encode complementary linguistic information. We introduce MoDAl (Modality Decorrelation and Alignment), a framework that discovers complementary neural modalities through the interplay of two objectives in a shared projection space. A contrastive loss aligns each of several parallel brain encoders with the text embeddings of a pretrained large language model (LLM), while a decorrelation loss prevents the encoders from coalescing to duplicative representations. We prove that these objectives are in productive tension: Contrastive alignment induces…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
