Shrinkage Optimized Directed Information using Pictorial Structures for Action Recognition
Xu Chen, Alfred Hero, Silvio Savarese

TL;DR
This paper introduces a novel action recognition framework combining pictorial structures, shrinkage optimized directed information, and Markov Random Fields to model temporal and spatial dependencies, achieving superior accuracy on multiple datasets.
Contribution
The paper presents a new SODA+MRF model that effectively captures directional temporal and spatial dependencies for improved action recognition.
Findings
Outperforms several state-of-the-art methods on multiple datasets
Robust to viewpoint transformations
Accurately detects complex human interactions
Abstract
In this paper, we propose a novel action recognition framework. The method uses pictorial structures and shrinkage optimized directed information assessment (SODA) coupled with Markov Random Fields called SODA+MRF to model the directional temporal dependency and bidirectional spatial dependency. As a variant of mutual information, directional information captures the directional information flow and temporal structure of video sequences across frames. Meanwhile, within each frame, Markov random fields are utilized to model the spatial relations among different parts of a human body and the body parts of different people. The proposed SODA+MRF model is robust to view point transformations and detect complex interactions accurately. We compare the proposed method against several baseline methods to highlight the effectiveness of the SODA+MRF model. We demonstrate that our algorithm has…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Video Analysis and Summarization
