Constructing Stronger and Faster Baselines for Skeleton-based Action Recognition
Yi-Fan Song, Zhang Zhang, Caifeng Shan, Liang Wang

TL;DR
This paper introduces an efficient, scalable GCN baseline for skeleton-based action recognition that outperforms state-of-the-art methods in accuracy and speed while maintaining smaller model size.
Contribution
The paper proposes a novel efficient GCN baseline with a compound scaling strategy, significantly improving accuracy and efficiency over existing models.
Findings
EfficientGCN-B4 achieves 91.7% accuracy on NTU 60 cross-subject benchmark.
The proposed model is 3.15x smaller and 3.21x faster than MS-G3D.
The method outperforms other SOTA models on large-scale datasets.
Abstract
One essential problem in skeleton-based action recognition is how to extract discriminative features over all skeleton joints. However, the complexity of the recent State-Of-The-Art (SOTA) models for this task tends to be exceedingly sophisticated and over-parameterized. The low efficiency in model training and inference has increased the validation costs of model architectures in large-scale datasets. To address the above issue, recent advanced separable convolutional layers are embedded into an early fused Multiple Input Branches (MIB) network, constructing an efficient Graph Convolutional Network (GCN) baseline for skeleton-based action recognition. In addition, based on such the baseline, we design a compound scaling strategy to expand the model's width and depth synchronously, and eventually obtain a family of efficient GCN baselines with high accuracies and small amounts of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Stroke Rehabilitation and Recovery · Medical Imaging and Analysis
MethodsGraph Convolutional Network
