TL;DR
This paper introduces a novel multi-task learning framework for video anomaly detection that combines self-supervised tasks and knowledge distillation to improve detection accuracy without requiring anomalous training data.
Contribution
It is the first to formulate video anomaly detection as a multi-task learning problem integrating multiple self-supervised and distillation tasks in a single architecture.
Findings
Outperforms state-of-the-art on Avenue, ShanghaiTech, and UCSD Ped2 datasets.
Demonstrates the effectiveness of combining self-supervised and distillation tasks.
Ablation study confirms the importance of multi-task learning approach.
Abstract
Anomaly detection in video is a challenging computer vision problem. Due to the lack of anomalous events at training time, anomaly detection requires the design of learning methods without full supervision. In this paper, we approach anomalous event detection in video through self-supervised and multi-task learning at the object level. We first utilize a pre-trained detector to detect objects. Then, we train a 3D convolutional neural network to produce discriminative anomaly-specific information by jointly learning multiple proxy tasks: three self-supervised and one based on knowledge distillation. The self-supervised tasks are: (i) discrimination of forward/backward moving objects (arrow of time), (ii) discrimination of objects in consecutive/intermittent frames (motion irregularity) and (iii) reconstruction of object-specific appearance information. The knowledge distillation task…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsKnowledge Distillation
