Temporal Attention-Augmented Graph Convolutional Network for Efficient Skeleton-Based Human Action Recognition
Negar Heidari, Alexandros Iosifidis

TL;DR
This paper introduces a temporal attention module integrated into a lightweight graph convolutional network to efficiently recognize human actions from skeleton data, significantly reducing computational costs while maintaining high accuracy.
Contribution
It proposes a novel temporal attention module for skeleton-based action recognition that enhances efficiency and reduces computation in GCN models.
Findings
Outperforms baseline GCN with 2.9x fewer computations
Achieves comparable accuracy to state-of-the-art with 9.6x less computation
Demonstrates effectiveness on benchmark datasets
Abstract
Graph convolutional networks (GCNs) have been very successful in modeling non-Euclidean data structures, like sequences of body skeletons forming actions modeled as spatio-temporal graphs. Most GCN-based action recognition methods use deep feed-forward networks with high computational complexity to process all skeletons in an action. This leads to a high number of floating point operations (ranging from 16G to 100G FLOPs) to process a single sample, making their adoption in restricted computation application scenarios infeasible. In this paper, we propose a temporal attention module (TAM) for increasing the efficiency in skeleton-based action recognition by selecting the most informative skeletons of an action at the early layers of the network. We incorporate the TAM in a light-weight GCN topology to further reduce the overall number of computations. Experimental results on two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTemporal Adaptive Module · Graph Convolutional Network
