Discriminative Video Representation Learning Using Support Vector Classifiers
Jue Wang, Anoop Cherian

TL;DR
This paper introduces a discriminative pooling method for video action recognition that identifies and emphasizes the most relevant features using a support vector machine framework, achieving state-of-the-art results.
Contribution
It proposes a novel discriminative pooling technique based on SVMs and multiple instance learning, improving video representation for action recognition.
Findings
Achieved state-of-the-art performance on eight benchmark datasets.
Demonstrated effectiveness of discriminative pooling over traditional methods.
End-to-end trainable within deep learning frameworks.
Abstract
Most popular deep models for action recognition in videos generate independent predictions for short clips, which are then pooled heuristically to assign an action label to the full video segment. As not all frames may characterize the underlying action---many are common across multiple actions---pooling schemes that impose equal importance on all frames might be unfavorable. In an attempt to tackle this problem, we propose discriminative pooling, based on the notion that among the deep features generated on all short clips, there is at least one that characterizes the action. To identify these useful features, we resort to a negative bag consisting of features that are known to be irrelevant, for example, they are sampled either from datasets that are unrelated to our actions of interest or are CNN features produced via random noise as input. With the features from the video as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Multimodal Machine Learning Applications · Anomaly Detection Techniques and Applications
MethodsAverage Pooling
