Covariance of Motion and Appearance Featuresfor Spatio Temporal Recognition Tasks
Subhabrata Bhattacharya, Nasim Souly, Mubarak Shah

TL;DR
This paper presents a novel covariance-based video analysis framework that combines motion and appearance features for improved spatio-temporal recognition, applicable to event and gesture recognition tasks.
Contribution
The paper introduces a new covariance descriptor for video analysis, integrating motion and appearance features, and formulates sparse recognition as a determinant maximization problem on Riemannian manifolds.
Findings
Effective in low-level event recognition in unconstrained scenarios
Successful in gesture recognition with one-shot learning
Provides promising results for large-scale video analysis
Abstract
In this paper, we introduce an end-to-end framework for video analysis focused towards practical scenarios built on theoretical foundations from sparse representation, including a novel descriptor for general purpose video analysis. In our approach, we compute kinematic features from optical flow and first and second-order derivatives of intensities to represent motion and appearance respectively. These features are then used to construct covariance matrices which capture joint statistics of both low-level motion and appearance features extracted from a video. Using an over-complete dictionary of the covariance based descriptors built from labeled training samples, we formulate low-level event recognition as a sparse linear approximation problem. Within this, we pose the sparse decomposition of a covariance matrix, which also conforms to the space of semi-positive definite matrices, as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Vision and Imaging · Video Surveillance and Tracking Methods
