TL;DR
This paper introduces RA-GCN, a multi-stream graph convolutional network that enhances robustness in skeleton-based action recognition by leveraging rich joint features and class activation maps to mitigate effects of occlusion and noise.
Contribution
The paper proposes a novel RA-GCN architecture that uses sequential streams guided by class activation maps to focus on unactivated joints, improving robustness against incomplete or noisy skeleton data.
Findings
Achieves comparable accuracy to state-of-the-art on NTU datasets.
Significantly reduces performance loss under occlusion and jittering.
Demonstrates robustness in synthetic noisy skeleton scenarios.
Abstract
Current methods for skeleton-based human action recognition usually work with complete skeletons. However, in real scenarios, it is inevitable to capture incomplete or noisy skeletons, which could significantly deteriorate the performance of current methods when some informative joints are occluded or disturbed. To improve the robustness of action recognition models, a multi-stream graph convolutional network (GCN) is proposed to explore sufficient discriminative features spreading over all skeleton joints, so that the distributed redundant representation reduces the sensitivity of the action models to non-standard skeletons. Concretely, the backbone GCN is extended by a series of ordered streams which is responsible for learning discriminative features from the joints less activated by preceding streams. Here, the activation degrees of skeleton joints of each GCN stream are measured by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsGraph Convolutional Network
