Revisiting Skeleton-based Action Recognition

Haodong Duan; Yue Zhao; Kai Chen; Dahua Lin; Bo Dai

arXiv:2104.13586·cs.CV·April 5, 2022·22 cites

Revisiting Skeleton-based Action Recognition

Haodong Duan, Yue Zhao, Kai Chen, Dahua Lin, Bo Dai

PDF

Open Access 4 Repos

TL;DR

This paper introduces PoseC3D, a novel 3D heatmap-based approach for skeleton-based action recognition that outperforms GCN methods in robustness, scalability, and multi-person scenarios.

Contribution

PoseC3D replaces graph convolutional networks with 3D heatmaps, improving feature learning, robustness, and multi-modal integration in skeleton-based action recognition.

Findings

01

PoseC3D outperforms GCN-based methods on four datasets.

02

PoseC3D is robust to pose estimation noise.

03

PoseC3D effectively handles multi-person scenarios.

Abstract

Human skeleton, as a compact representation of human action, has received increasing attention in recent years. Many skeleton-based action recognition methods adopt graph convolutional networks (GCN) to extract features on top of human skeletons. Despite the positive results shown in previous works, GCN-based methods are subject to limitations in robustness, interoperability, and scalability. In this work, we propose PoseC3D, a new approach to skeleton-based action recognition, which relies on a 3D heatmap stack instead of a graph sequence as the base representation of human skeletons. Compared to GCN-based methods, PoseC3D is more effective in learning spatiotemporal features, more robust against pose estimation noises, and generalizes better in cross-dataset settings. Also, PoseC3D can handle multiple-person scenarios without additional computation cost, and its features can be easily…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Gait Recognition and Analysis · Hand Gesture Recognition Systems

MethodsGraph Convolutional Networks · Heatmap