Confidence-Based Self-Training for EMG-to-Speech: Leveraging Synthetic EMG for Robust Modeling

Xiaodan Chen; Xiaoxue Gao; Mathias Quoy; Alexandre Pitti; Nancy F.Chen

arXiv:2506.11862·cs.SD·January 13, 2026

Confidence-Based Self-Training for EMG-to-Speech: Leveraging Synthetic EMG for Robust Modeling

Xiaodan Chen, Xiaoxue Gao, Mathias Quoy, Alexandre Pitti, Nancy F.Chen

PDF

Open Access

TL;DR

This paper introduces a confidence-based self-training method for EMG-to-speech conversion, utilizing synthetic EMG data and a new dataset to improve speech reconstruction accuracy despite limited real data.

Contribution

It presents a novel confidence-based multi-speaker self-training approach and a curated dataset to enhance EMG-to-speech models with synthetic data filtering.

Findings

01

Improved phoneme accuracy in EMG-to-speech models

02

Reduced phonological confusion

03

Lowered word error rate

Abstract

Voiced Electromyography (EMG)-to-Speech (V-ETS) models reconstruct speech from muscle activity signals, facilitating applications such as neurolaryngologic diagnostics. Despite its potential, the advancement of V-ETS is hindered by a scarcity of paired EMG-speech data. To address this, we propose a novel Confidence-based Multi-Speaker Self-training (CoM2S) approach, along with a newly curated Libri-EMG dataset. This approach leverages synthetic EMG data generated by a pre-trained model, followed by a proposed filtering mechanism based on phoneme-level confidence to enhance the ETS model through the proposed self-training techniques. Experiments demonstrate our method improves phoneme accuracy, reduces phonological confusion, and lowers word error rate, confirming the effectiveness of our CoM2S approach for V-ETS. In support of future research, we will release the codes and the proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Muscle activation and electromyography studies · Speech Recognition and Synthesis