Deep Discriminative Model for Video Classification
Mohammad Tavakolian, Abdenour Hadid

TL;DR
This paper introduces a novel deep learning framework for video scene classification that combines unsupervised pre-training, sparse spatiotemporal feature extraction, and class-specific modeling to improve accuracy across diverse datasets.
Contribution
It proposes a Heterogeneous Deep Discriminative Model with layer-wise pre-training and a new sparse pattern representation for effective video classification.
Findings
Achieves superior accuracy on multiple datasets
Outperforms state-of-the-art methods consistently
Demonstrates robustness across diverse video types
Abstract
This paper presents a new deep learning approach for video-based scene classification. We design a Heterogeneous Deep Discriminative Model (HDDM) whose parameters are initialized by performing an unsupervised pre-training in a layer-wise fashion using Gaussian Restricted Boltzmann Machines (GRBM). In order to avoid the redundancy of adjacent frames, we extract spatiotemporal variation patterns within frames and represent them sparsely using Sparse Cubic Symmetrical Pattern (SCSP). Then, a pre-initialized HDDM is separately trained using the videos of each class to learn class-specific models. According to the minimum reconstruction error from the learnt class-specific models, a weighted voting strategy is employed for the classification. The performance of the proposed method is extensively evaluated on two action recognition datasets; UCF101 and Hollywood II, and three dynamic texture…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Anomaly Detection Techniques and Applications
