CAT-Net: A Cross-Attention Tone Network for Cross-Subject EEG-EMG Fusion Tone Decoding
Yifan Zhuang, Calvin Huang, Zepeng Yu, Yongjie Zou, Jiawei Ju

TL;DR
This paper introduces CAT-Net, a novel cross-attention neural network that fuses EEG and EMG signals for Mandarin tone decoding, achieving high accuracy with minimal channels and good cross-subject generalization.
Contribution
The study proposes a new multimodal BCI framework with a cross-attention mechanism and domain-adversarial training for effective cross-subject Mandarin tone classification.
Findings
Achieved over 87% accuracy in tone classification.
Demonstrated effective decoding with minimal EEG and EMG channels.
Maintained strong cross-subject performance with over 83% accuracy.
Abstract
Brain-computer interface (BCI) speech decoding has emerged as a promising tool for assisting individuals with speech impairments. In this context, the integration of electroencephalography (EEG) and electromyography (EMG) signals offers strong potential for enhancing decoding performance. Mandarin tone classification presents particular challenges, as tonal variations convey distinct meanings even when phonemes remain identical. In this study, we propose a novel cross-subject multimodal BCI decoding framework that fuses EEG and EMG signals to classify four Mandarin tones under both audible and silent speech conditions. Inspired by the cooperative mechanisms of neural and muscular systems in speech production, our neural decoding architecture combines spatial-temporal feature extraction branches with a cross-attention fusion mechanism, enabling informative interaction between modalities.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Emotion and Mood Recognition · Muscle activation and electromyography studies
