IsoNet: Causal Analysis of Multimodal Transformers for Neuromuscular Gesture Classification
Eion Tyacke, Kunal Gupta, Jay Patel, Rui Li

TL;DR
This paper evaluates multimodal fusion strategies in transformer architectures for neuromuscular gesture classification, demonstrating that attention-based hierarchical fusion significantly improves accuracy and provides insights into modality interactions.
Contribution
It introduces an Isolation Network to quantify modality interactions and compares fusion strategies across multiple architectures, advancing understanding of multimodal biosignal decoding.
Findings
Hierarchical Transformer with attention fusion outperforms baselines by over 10%.
Cross-modal interactions contribute about 30% of decision signals.
Attention-driven fusion enhances biosignal classification accuracy.
Abstract
Hand gestures are a primary output of the human motor system, yet the decoding of their neuromuscular signatures remains a bottleneck for basic neuroscience and assistive technologies such as prosthetics. Traditional human-machine interface pipelines rely on a single biosignal modality, but multimodal fusion can exploit complementary information from sensors. We systematically compare linear and attention-based fusion strategies across three architectures: a Multimodal MLP, a Multimodal Transformer, and a Hierarchical Transformer, evaluating performance on scenarios with unimodal and multimodal inputs. Experiments use two publicly available datasets: NinaPro DB2 (sEMG and accelerometer) and HD-sEMG 65-Gesture (high-density sEMG and force). Across both datasets, the Hierarchical Transformer with attention-based fusion consistently achieved the highest accuracy, surpassing the multimodal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMuscle activation and electromyography studies · Advanced Sensor and Energy Harvesting Materials · EEG and Brain-Computer Interfaces
MethodsLayer Normalization · Dropout · Absolute Position Encodings · Dense Connections · Byte Pair Encoding · Softmax · Label Smoothing · Transformer
