Pose Uncertainty Aware Movement Synchrony Estimation via Spatial-Temporal Graph Transformer
Jicheng Li, Anjana Bhat, Roghayeh Barmaki

TL;DR
This paper introduces a novel skeleton-based graph transformer model that estimates movement synchrony with pose uncertainty awareness, leveraging a new dataset and techniques like contrastive learning and knowledge distillation for improved accuracy and privacy.
Contribution
It presents a specialized spatial-temporal graph transformer for movement synchrony estimation that incorporates pose confidence scores and a temporal similarity matrix, advancing prior methods.
Findings
Achieved 88.98% accuracy on PT13 dataset.
Surpassed existing approaches significantly in movement synchrony estimation.
Effectively integrated pose uncertainty and privacy-preserving techniques.
Abstract
Movement synchrony reflects the coordination of body movements between interacting dyads. The estimation of movement synchrony has been automated by powerful deep learning models such as transformer networks. However, instead of designing a specialized network for movement synchrony estimation, previous transformer-based works broadly adopted architectures from other tasks such as human activity recognition. Therefore, this paper proposed a skeleton-based graph transformer for movement synchrony estimation. The proposed model applied ST-GCN, a spatial-temporal graph convolutional neural network for skeleton feature extraction, followed by a spatial transformer for spatial feature generation. The spatial transformer is guided by a uniquely designed joint position embedding shared between the same joints of interacting individuals. Besides, we incorporated a temporal similarity matrix in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStroke Rehabilitation and Recovery · EEG and Brain-Computer Interfaces · Muscle activation and electromyography studies
MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Softmax · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Layer Normalization · Adam · Byte Pair Encoding · Label Smoothing
