Learning a Pose Lexicon for Semantic Action Recognition

Lijuan Zhou; Wanqing Li; and Philip Ogunbona

arXiv:1604.00147·cs.CV·November 15, 2016

Learning a Pose Lexicon for Semantic Action Recognition

Lijuan Zhou, Wanqing Li, and Philip Ogunbona

PDF

TL;DR

This paper introduces a method to learn a semantic pose lexicon linking textual instructions with visual poses, enabling improved action recognition, including zero-shot scenarios, through probabilistic mapping and sequence matching.

Contribution

It proposes a novel approach to learn a pose lexicon that bridges semantic and visual representations for action recognition.

Findings

01

Effective in zero-shot action recognition

02

Validated on MSRC-12 and WorkoutSu-10 datasets

03

Improves semantic understanding of actions

Abstract

This paper presents a novel method for learning a pose lexicon comprising semantic poses defined by textual instructions and their associated visual poses defined by visual features. The proposed method simultaneously takes two input streams, semantic poses and visual pose candidates, and statistically learns a mapping between them to construct the lexicon. With the learned lexicon, action recognition can be cast as the problem of finding the maximum translation probability of a sequence of semantic poses given a stream of visual pose candidates. Experiments evaluating pre-trained and zero-shot action recognition conducted on MSRC-12 gesture and WorkoutSu-10 exercise datasets were used to verify the efficacy of the proposed method.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.