Skeleton-Based Action Recognition with Multi-Stream Adaptive Graph   Convolutional Networks

Lei Shi; Yifan Zhang; Jian Cheng; Hanqing Lu

arXiv:1912.06971·cs.CV·December 2, 2020

Skeleton-Based Action Recognition with Multi-Stream Adaptive Graph Convolutional Networks

Lei Shi, Yifan Zhang, Jian Cheng, Hanqing Lu

PDF

2 Repos

TL;DR

This paper introduces MS-AAGCN, a flexible graph convolutional network with adaptive topology and multi-stream attention, significantly improving skeleton-based action recognition accuracy on large datasets.

Contribution

The work presents a novel adaptive graph construction method and a multi-stream framework with attention modules, enhancing recognition performance over fixed-topology GCNs.

Findings

01

Outperforms state-of-the-art on NTU-RGBD and Kinetics-Skeleton datasets.

02

Adaptive graph learning improves model flexibility and accuracy.

03

Multi-stream attention enhances focus on key joints and features.

Abstract

Graph convolutional networks (GCNs), which generalize CNNs to more generic non-Euclidean structures, have achieved remarkable performance for skeleton-based action recognition. However, there still exist several issues in the previous GCN-based models. First, the topology of the graph is set heuristically and fixed over all the model layers and input data. This may not be suitable for the hierarchy of the GCN model and the diversity of the data in action recognition tasks. Second, the second-order information of the skeleton data, i.e., the length and orientation of the bones, is rarely investigated, which is naturally more informative and discriminative for the human action recognition. In this work, we propose a novel multi-stream attention-enhanced adaptive graph convolutional neural network (MS-AAGCN) for skeleton-based action recognition. The graph topology in our model can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsGraph Convolutional Network