Joint Temporal Pooling for Improving Skeleton-based Action Recognition

Shanaka Ramesh Gunasekara; Wanqing Li; Jack Yang; Philip Ogunbona

arXiv:2408.09356·cs.CV·August 20, 2024

Joint Temporal Pooling for Improving Skeleton-based Action Recognition

Shanaka Ramesh Gunasekara, Wanqing Li, Jack Yang, Philip Ogunbona

PDF

Open Access

TL;DR

This paper introduces JMAP, a novel temporal pooling method that adaptively emphasizes discriminative motion segments in skeleton-based action recognition, outperforming traditional methods.

Contribution

The paper proposes a new Joint Motion Adaptive Temporal Pooling (JMAP) method with frame-wise and joint-wise variants for enhanced action recognition.

Findings

01

JMAP improves recognition accuracy on NTU RGB+D 120 and PKU-MMD datasets.

02

JMAP effectively captures discriminative motion segments.

03

Experimental results demonstrate the superiority of JMAP over conventional pooling methods.

Abstract

In skeleton-based human action recognition, temporal pooling is a critical step for capturing spatiotemporal relationship of joint dynamics. Conventional pooling methods overlook the preservation of motion information and treat each frame equally. However, in an action sequence, only a few segments of frames carry discriminative information related to the action. This paper presents a novel Joint Motion Adaptive Temporal Pooling (JMAP) method for improving skeleton-based action recognition. Two variants of JMAP, frame-wise pooling and joint-wise pooling, are introduced. The efficacy of JMAP has been validated through experiments on the popular NTU RGB+D 120 and PKU-MMD datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Gait Recognition and Analysis