Are Spatial-Temporal Graph Convolution Networks for Human Action Recognition Over-Parameterized?

Jianyang Xie; Yitian Zhao; Yanda Meng; He Zhao; Anh Nguyen; Yalin Zheng

arXiv:2505.10679·cs.CV·May 19, 2025

Are Spatial-Temporal Graph Convolution Networks for Human Action Recognition Over-Parameterized?

Jianyang Xie, Yitian Zhao, Yanda Meng, He Zhao, Anh Nguyen, Yalin Zheng

PDF

Open Access 1 Repo

TL;DR

This paper investigates the over-parameterization of spatial-temporal graph convolutional networks for human action recognition and introduces a sparse architecture that maintains performance with significantly fewer parameters.

Contribution

The authors propose a novel sparse ST-GCN generator and multi-level sparsity models that reduce parameters while preserving or improving recognition accuracy.

Findings

01

Sparse ST-GCNs achieve comparable performance with 95% fewer parameters.

02

Multi-level sparsity ST-GCNs outperform dense models with only 66% of the parameters.

03

Experiments on four datasets validate the effectiveness of the sparse architectures.

Abstract

Spatial-temporal graph convolutional networks (ST-GCNs) showcase impressive performance in skeleton-based human action recognition (HAR). However, despite the development of numerous models, their recognition performance does not differ significantly after aligning the input settings. With this observation, we hypothesize that ST-GCNs are over-parameterized for HAR, a conjecture subsequently confirmed through experiments employing the lottery ticket hypothesis. Additionally, a novel sparse ST-GCNs generator is proposed, which trains a sparse architecture from a randomly initialized dense network while maintaining comparable performance levels to the dense components. Moreover, we generate multi-level sparsity ST-GCNs by integrating sparse structures at various sparsity levels and demonstrate that the assembled model yields a significant enhancement in HAR performance. Thorough…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

davelailai/sparse-st-gcn
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Context-Aware Activity Recognition Systems · Advanced Technologies in Various Fields