ALIGN: Adversarial Learning for Generalizable Speech Neuroprosthesis
Zhanqi Zhang, Shun Li, Bernardo L. Sabatini, Mikio Aoi, Gal Mishne

TL;DR
ALIGN introduces an adversarial learning framework that enhances the generalization of intracortical speech decoding models across different recording sessions, addressing session variability issues in brain-computer interfaces.
Contribution
The paper proposes a novel session-invariant learning method using multi-domain adversarial neural networks for semi-supervised cross-session adaptation in BCIs.
Findings
ALIGN improves phoneme and word error rates on unseen sessions.
Adversarial domain alignment effectively mitigates session-level distribution shifts.
The method enhances robustness of speech decoding over time.
Abstract
Intracortical brain-computer interfaces (BCIs) can decode speech from neural activity with high accuracy when trained on data pooled across recording sessions. In realistic deployment, however, models must generalize to new sessions without labeled data, and performance often degrades due to cross-session nonstationarities (e.g., electrode shifts, neural turnover, and changes in user strategy). In this paper, we propose ALIGN, a session-invariant learning framework based on multi-domain adversarial neural networks for semi-supervised cross-session adaptation. ALIGN trains a feature encoder jointly with a phoneme classifier and a domain classifier operating on the latent representation. Through adversarial optimization, the encoder is encouraged to preserve task-relevant information while suppressing session-specific cues. We evaluate ALIGN on intracortical speech decoding and find that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Speech Recognition and Synthesis · Domain Adaptation and Few-Shot Learning
