Dynamic Feature Description in Human Action Recognition

Ruoyun Gao; Michael S. Lew; Ling Shao

arXiv:1101.0234·cs.HC·January 4, 2011·5 cites

Dynamic Feature Description in Human Action Recognition

Ruoyun Gao, Michael S. Lew, Ling Shao

PDF

Open Access

TL;DR

This paper introduces new feature description methods for human action recognition that enhance the discriminative power of spatial-temporal interest point features in video sequences.

Contribution

It proposes novel description techniques that improve the discriminative ability of features for human action recognition based on interest points and cuboids.

Findings

01

Enhanced feature descriptors increase recognition accuracy.

02

Interest point-based descriptions capture structural and informational content.

03

Proposed methods outperform existing approaches in discriminability.

Abstract

This work aims to present novel description methods for human action recognition. Generally, a video sequence can be represented as a collection of spatial temporal words by detecting space-time interest points and describing the unique features around the detected points (Bag of Words representation). Interest points as well as the cuboids around them are considered informative for feature description in terms of both the structural distribution of interest points and the information content inside the cuboids. Our proposed description approaches are based on this idea and making the feature descriptors more discriminative.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Video Analysis and Summarization · Multimodal Machine Learning Applications