Masked Autoencoder with Swin Transformer Network for Mitigating Electrode Shift in HD-EMG-based Gesture Recognition
Kasra Laamerad, Mehran Shabanpour, Md. Rabiul Islam, Arash Mohammadi

TL;DR
This paper introduces MAST, a novel self-supervised learning framework using masked autoencoders and Swin Transformer architecture to enhance gesture recognition robustness against electrode shift in high-density sEMG signals.
Contribution
The paper proposes a new MAST framework that employs multiple masking strategies and a multi-path Swin-Unet architecture for improved electrode shift robustness in HD-sEMG gesture recognition.
Findings
MAST outperforms existing methods in robustness against electrode shift.
Multi-mask strategies enhance the learned representations.
Self-supervised pre-training improves generalization across sessions.
Abstract
Multi-channel surface Electromyography (sEMG), also referred to as high-density sEMG (HD-sEMG), plays a crucial role in improving gesture recognition performance for myoelectric control. Pattern recognition models developed based on HD-sEMG, however, are vulnerable to changing recording conditions (e.g., signal variability due to electrode shift). This has resulted in significant degradation in performance across subjects, and sessions. In this context, the paper proposes the Masked Autoencoder with Swin Transformer (MAST) framework, where training is performed on a masked subset of HDsEMG channels. A combination of four masking strategies, i.e., random block masking; temporal masking; sensor-wise random masking, and; multi-scale masking, is used to learn latent representations and increase robustness against electrode shift. The masked data is then passed through MAST's three-path…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Muscle activation and electromyography studies
MethodsLinear Layer · Dense Connections · Stochastic Depth · Multi-Head Attention · Adam · Softmax · Dropout · Absolute Position Encodings · Label Smoothing · Byte Pair Encoding
