Dominant Codewords Selection with Topic Model for Action Recognition
Hirokatsu Kataoka, Masaki Hayashi, Kenji Iwata, Yutaka Satoh,, Yoshimitsu Aoki, Slobodan Ilic

TL;DR
This paper introduces a novel action recognition framework that leverages dominant codewords and topic modeling to improve the representation of human activities by focusing on motion primitives and reducing noise.
Contribution
It presents a new method combining LDA-based topic modeling with dominant codewords for more accurate human activity recognition.
Findings
Effective in recognizing actions across four datasets
Improves motion primitive representation by eliminating noise
Enhances activity classification accuracy
Abstract
In this paper, we propose a framework for recognizing human activities that uses only in-topic dominant codewords and a mixture of intertopic vectors. Latent Dirichlet allocation (LDA) is used to develop approximations of human motion primitives; these are mid-level representations, and they adaptively integrate dominant vectors when classifying human activities. In LDA topic modeling, action videos (documents) are represented by a bag-of-words (input from a dictionary), and these are based on improved dense trajectories. The output topics correspond to human motion primitives, such as finger moving or subtle leg motion. We eliminate the impurities, such as missed tracking or changing light conditions, in each motion primitive. The assembled vector of motion primitives is an improved representation of the action. We demonstrate our method on four different datasets.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Gait Recognition and Analysis · Video Analysis and Summarization
MethodsLinear Discriminant Analysis
