Detecting Violence in Video using Subclasses
Xirong Li, Yujia Huo, Jieping Xu, Qin Jin

TL;DR
This paper introduces a subclass-based approach for violence detection in videos, leveraging fine-grained annotations to improve fusion of multi-modal features and outperform existing methods.
Contribution
It presents a novel subclass annotation method that enhances violence detection accuracy and generalization without requiring fine-grained test set labels.
Findings
Achieved AP of 0.303 and P100 of 0.55 on MediaEval 2015 dataset.
Motion features are not essential for violence detection.
Subclass annotations improve fusion and detection performance.
Abstract
This paper attacks the challenging problem of violence detection in videos. Different from existing works focusing on combining multi-modal features, we go one step further by adding and exploiting subclasses visually related to violence. We enrich the MediaEval 2015 violence dataset by \emph{manually} labeling violence videos with respect to the subclasses. Such fine-grained annotations not only help understand what have impeded previous efforts on learning to fuse the multi-modal features, but also enhance the generalization ability of the learned fusion to novel test data. The new subclass based solution, with AP of 0.303 and P100 of 0.55 on the MediaEval 2015 test set, outperforms several state-of-the-art alternatives. Notice that our solution does not require fine-grained annotations on the test set, so it can be directly applied on novel and fully unlabeled videos. Interestingly,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Video Surveillance and Tracking Methods
