ALIGN: Adversarial Learning for Generalizable Speech Neuroprosthesis

Zhanqi Zhang; Shun Li; Bernardo L. Sabatini; Mikio Aoi; Gal Mishne

arXiv:2603.18299·cs.LG·March 20, 2026

ALIGN: Adversarial Learning for Generalizable Speech Neuroprosthesis

Zhanqi Zhang, Shun Li, Bernardo L. Sabatini, Mikio Aoi, Gal Mishne

PDF

Open Access

TL;DR

ALIGN introduces an adversarial learning framework that enhances the generalization of intracortical speech decoding models across different recording sessions, addressing session variability issues in brain-computer interfaces.

Contribution

The paper proposes a novel session-invariant learning method using multi-domain adversarial neural networks for semi-supervised cross-session adaptation in BCIs.

Findings

01

ALIGN improves phoneme and word error rates on unseen sessions.

02

Adversarial domain alignment effectively mitigates session-level distribution shifts.

03

The method enhances robustness of speech decoding over time.

Abstract

Intracortical brain-computer interfaces (BCIs) can decode speech from neural activity with high accuracy when trained on data pooled across recording sessions. In realistic deployment, however, models must generalize to new sessions without labeled data, and performance often degrades due to cross-session nonstationarities (e.g., electrode shifts, neural turnover, and changes in user strategy). In this paper, we propose ALIGN, a session-invariant learning framework based on multi-domain adversarial neural networks for semi-supervised cross-session adaptation. ALIGN trains a feature encoder jointly with a phoneme classifier and a domain classifier operating on the latent representation. Through adversarial optimization, the encoder is encouraged to preserve task-relevant information while suppressing session-specific cues. We evaluate ALIGN on intracortical speech decoding and find that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEEG and Brain-Computer Interfaces · Speech Recognition and Synthesis · Domain Adaptation and Few-Shot Learning