Towards unified brain-to-text decoding across speech production and perception
Zhizhang Yuan, Yang Yang, Gaorui Zhang, Baowen Cheng, Zehan Wu, Yuhao Xu, Xiaoying Liu, Liang Chen, Ying Mao, Meng Li

TL;DR
This study introduces a unified brain-to-text decoding framework for Mandarin Chinese that generalizes across speech production and perception, leveraging neural signals and large language models to decode sentences from single-character training data.
Contribution
The paper presents a novel unified decoding framework for both speech production and perception in Mandarin, demonstrating strong generalization and improved decoding performance using a large language model.
Findings
Decoding involves classifying syllable components from neural signals.
Neural responses differ across cortical regions and modalities.
Decoding performance is similar across hemispheres.
Abstract
Speech production and perception are the main ways humans communicate daily. Prior brain-to-text decoding studies have largely focused on a single modality and alphabetic languages. Here, we present a unified brain-to-sentence decoding framework for both speech production and perception in Mandarin Chinese. The framework exhibits strong generalization ability, enabling sentence-level decoding when trained only on single-character data and supporting characters and syllables unseen during training. In addition, it allows direct and controlled comparison of neural dynamics across modalities. Mandarin speech is decoded by first classifying syllable components in Hanyu Pinyin, namely initials and finals, from neural signals, followed by a post-trained large language model (LLM) that maps sequences of toneless Pinyin syllables to Chinese sentences. To enhance LLM decoding, we designed a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeurobiology of Language and Bilingualism · Phonetics and Phonology Research · Language Development and Disorders
