Towards EMG-to-Speech with a Necklace Form Factor

Peter Wu; Ryan Kaveh; Raghav Nautiyal; Christine Zhang; Albert Guo,; Anvitha Kachinthaya; Tavish Mishra; Bohan Yu; Alan W Black; Rikky Muller,; Gopala Krishna Anumanchipalli

arXiv:2407.21345·eess.AS·August 1, 2024·1 cites

Towards EMG-to-Speech with a Necklace Form Factor

Peter Wu, Ryan Kaveh, Raghav Nautiyal, Christine Zhang, Albert Guo,, Anvitha Kachinthaya, Tavish Mishra, Bohan Yu, Alan W Black, Rikky Muller,, Gopala Krishna Anumanchipalli

PDF

Open Access

TL;DR

This paper investigates a novel neck-worn EMG device for speech decoding, achieving high accuracy and revealing important electrode configurations, with potential for more convenient speech interfaces.

Contribution

It introduces a neck-based EMG device for speech decoding, demonstrating high accuracy and analyzing electrode importance and speech-EMG relationships.

Findings

01

92.7% classification accuracy with the neck device

02

More than two electrodes improve performance

03

Linear relationship between EMG spectrograms and speech representations

Abstract

Electrodes for decoding speech from electromyography (EMG) are typically placed on the face, requiring adhesives that are inconvenient and skin-irritating if used regularly. We explore a different device form factor, where dry electrodes are placed around the neck instead. 11-word, multi-speaker voiced EMG classifiers trained on data recorded with this device achieve 92.7% accuracy. Ablation studies reveal the importance of having more than two electrodes on the neck, and phonological analyses reveal similar classification confusions between neck-only and neck-and-face form factors. Finally, speech-EMG correlation experiments demonstrate a linear relationship between many EMG spectrogram frequency bins and self-supervised speech representation dimensions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Communication and Language · Assistive Technology in Communication and Mobility