From Continuous sEMG Signals to Discrete Muscle State Tokens: A Robust and Interpretable Representation Framework
Yuepeng Chen, Kaili Zheng, Ji Wu, Zhuangzhuang Li, Ye Ma, Dongwei Liu, Chenyi Guo, and Xiangling Fu

TL;DR
This paper introduces a novel discrete, physiologically-informed tokenization framework for sEMG signals, enabling robust, interpretable, and efficient muscle activity representation across subjects, with strong performance in action recognition and movement assessment.
Contribution
The study presents a new tokenization method for sEMG signals, a large-scale benchmark dataset, and extensive evaluations demonstrating improved consistency, interpretability, and accuracy over raw signals.
Findings
High inter-subject consistency (Cohen's Kappa = 0.82)
Improved action recognition accuracy (75.5% with ViT)
Tokens reveal muscle activation patterns for movement quality assessment
Abstract
Surface electromyography (sEMG) signals exhibit substantial inter-subject variability and are highly susceptible to noise, posing challenges for robust and interpretable decoding. To address these limitations, we propose a discrete representation of sEMG signals based on a physiology-informed tokenization framework. The method employs a sliding window aligned with the minimal muscle contraction cycle to isolate individual muscle activation events. From each window, ten time-frequency features, including root mean square (RMS) and median frequency (MDF), are extracted, and K-means clustering is applied to group segments into representative muscle-state tokens. We also introduce a large-scale benchmark dataset, ActionEMG-43, comprising 43 diverse actions and sEMG recordings from 16 major muscle groups across the body. Based on this dataset, we conduct extensive evaluations to assess the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMuscle activation and electromyography studies · Advanced Sensor and Energy Harvesting Materials · Motor Control and Adaptation
