Multi-Dimensional Refinement Graph Convolutional Network with Robust Decouple Loss for Fine-Grained Skeleton-Based Action Recognition
Sheng-Lan Liu, Yu-Ning Ding, Jin-Rong Zhang, Kai-Yuan Liu, Si-Fan, Zhang, Fei-Long Wang, and Gao Huang

TL;DR
This paper introduces a novel graph convolutional network with a specialized attention mechanism and a robust loss function to improve fine-grained skeleton-based action recognition, effectively handling noisy data and inter-class similarities.
Contribution
It proposes the MDR-GCN model with CVSTA attention block and RDL loss, advancing fine-grained action recognition accuracy over existing methods.
Findings
Outperforms state-of-the-art on FineGym99 and FSD-10 datasets.
Effectively reduces noise impact in skeleton data.
Enhances discrimination among subtle action classes.
Abstract
Graph convolutional networks have been widely used in skeleton-based action recognition. However, existing approaches are limited in fine-grained action recognition due to the similarity of inter-class data. Moreover, the noisy data from pose extraction increases the challenge of fine-grained recognition. In this work, we propose a flexible attention block called Channel-Variable Spatial-Temporal Attention (CVSTA) to enhance the discriminative power of spatial-temporal joints and obtain a more compact intra-class feature distribution. Based on CVSTA, we construct a Multi-Dimensional Refinement Graph Convolutional Network (MDR-GCN), which can improve the discrimination among channel-, joint- and frame-level features for fine-grained actions. Furthermore, we propose a Robust Decouple Loss (RDL), which significantly boosts the effect of the CVSTA and reduces the impact of noise. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Stroke Rehabilitation and Recovery · Medical Imaging and Analysis
