Dynamic Feature Description in Human Action Recognition
Ruoyun Gao, Michael S. Lew, Ling Shao

TL;DR
This paper introduces new feature description methods for human action recognition that enhance the discriminative power of spatial-temporal interest point features in video sequences.
Contribution
It proposes novel description techniques that improve the discriminative ability of features for human action recognition based on interest points and cuboids.
Findings
Enhanced feature descriptors increase recognition accuracy.
Interest point-based descriptions capture structural and informational content.
Proposed methods outperform existing approaches in discriminability.
Abstract
This work aims to present novel description methods for human action recognition. Generally, a video sequence can be represented as a collection of spatial temporal words by detecting space-time interest points and describing the unique features around the detected points (Bag of Words representation). Interest points as well as the cuboids around them are considered informative for feature description in terms of both the structural distribution of interest points and the information content inside the cuboids. Our proposed description approaches are based on this idea and making the feature descriptors more discriminative.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Analysis and Summarization · Multimodal Machine Learning Applications
