Ske2Grid: Skeleton-to-Grid Representation Learning for Action   Recognition

Dongqi Cai; Yangyuxuan Kang; Anbang Yao; Yurong Chen

arXiv:2308.07571·cs.CV·August 16, 2023·2 cites

Ske2Grid: Skeleton-to-Grid Representation Learning for Action Recognition

Dongqi Cai, Yangyuxuan Kang, Anbang Yao, Yurong Chen

PDF

Open Access 1 Repo 1 Video

TL;DR

Ske2Grid introduces a novel grid-based skeleton representation with a learnable convolution framework, significantly improving skeleton-based action recognition accuracy across multiple datasets.

Contribution

The paper proposes Ske2Grid, a new grid representation learning method with a graph-node index transform, up-sampling transform, and progressive learning strategy for enhanced action recognition.

Findings

01

Outperforms existing GCN-based methods on six datasets

02

Achieves significant accuracy improvements without complex techniques

03

Demonstrates the effectiveness of grid-based skeleton representations

Abstract

This paper presents Ske2Grid, a new representation learning framework for improved skeleton-based action recognition. In Ske2Grid, we define a regular convolution operation upon a novel grid representation of human skeleton, which is a compact image-like grid patch constructed and learned through three novel designs. Specifically, we propose a graph-node index transform (GIT) to construct a regular grid patch through assigning the nodes in the skeleton graph one by one to the desired grid cells. To ensure that GIT is a bijection and enrich the expressiveness of the grid representation, an up-sampling transform (UPT) is learned to interpolate the skeleton graph nodes for filling the grid patch to the full. To resolve the problem when the one-step UPT is aggressive and further exploit the representation capability of the grid patch with increasing spatial size, a progressive learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

osvai/ske2grid
pytorchOfficial

Videos

Ske2Grid: Skeleton-to-Grid Representation Learning for Action Recognition· slideslive

Taxonomy

TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Stroke Rehabilitation and Recovery

MethodsConvolution