Skeleton based Zero Shot Action Recognition in Joint Pose-Language   Semantic Space

Bhavan Jasani; Afshaan Mazagonwalla

arXiv:1911.11344·cs.CV·November 27, 2019·19 cites

Skeleton based Zero Shot Action Recognition in Joint Pose-Language Semantic Space

Bhavan Jasani, Afshaan Mazagonwalla

PDF

Open Access

TL;DR

This paper introduces a zero shot action recognition method using joint pose-language semantic space, enabling the classification of unseen actions based on pose features and natural language descriptions, demonstrated on NTU RGB-D dataset.

Contribution

The work presents a novel pose-based zero shot action recognition network that jointly models visual pose features and natural language semantics for unseen action classification.

Findings

01

Effective recognition of unseen actions in NTU RGB-D dataset

02

Joint pose-language semantic space encodes useful knowledge

03

Model outperforms baseline methods in zero shot setting

Abstract

How does one represent an action? How does one describe an action that we have never seen before? Such questions are addressed by the Zero Shot Learning paradigm, where a model is trained on only a subset of classes and is evaluated on its ability to correctly classify an example from a class it has never seen before. In this work, we present a body pose based zero shot action recognition network and demonstrate its performance on the NTU RGB-D dataset. Our model learns to jointly encapsulate visual similarities based on pose features of the action performer as well as similarities in the natural language descriptions of the unseen action class names. We demonstrate how this pose-language semantic space encodes knowledge which allows our model to correctly predict actions not seen during training.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Gait Recognition and Analysis