Towards Homogeneous Lexical Tone Decoding from Heterogeneous   Intracranial Recordings

Di Wu; Siyuan Li; Chen Feng; Lu Cao; Yue Zhang; Jie Yang; Mohamad; Sawan

arXiv:2410.12866·cs.CL·February 19, 2025

Towards Homogeneous Lexical Tone Decoding from Heterogeneous Intracranial Recordings

Di Wu, Siyuan Li, Chen Feng, Lu Cao, Yue Zhang, Jie Yang, Mohamad, Sawan

PDF

Open Access

TL;DR

This paper introduces H2DiLR, a novel framework that disentangles neural homogeneity and heterogeneity to improve lexical tone decoding from intracranial recordings across multiple subjects, advancing brain-computer interface capabilities for speech-impaired tonal language speakers.

Contribution

The paper presents H2DiLR, a new method that effectively captures shared and individual neural features for improved cross-subject lexical tone decoding.

Findings

01

H2DiLR significantly outperforms traditional heterogeneous decoding methods.

02

H2DiLR effectively captures both homogeneity and heterogeneity in neural representations.

03

The framework demonstrates robust decoding across multiple Mandarin syllables.

Abstract

Recent advancements in brain-computer interfaces (BCIs) have enabled the decoding of lexical tones from intracranial recordings, offering the potential to restore the communication abilities of speech-impaired tonal language speakers. However, data heterogeneity induced by both physiological and instrumental factors poses a significant challenge for unified invasive brain tone decoding. Traditional subject-specific models, which operate under a heterogeneous decoding paradigm, fail to capture generalized neural representations and cannot effectively leverage data across subjects. To address these limitations, we introduce Homogeneity-Heterogeneity Disentangled Learning for neural Representations (H2DiLR), a novel framework that disentangles and learns both the homogeneity and heterogeneity from intracranial recordings across multiple subjects. To evaluate H2DiLR, we collected…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Music and Audio Processing · Speech and Audio Processing