SSMTL++: Revisiting Self-Supervised Multi-Task Learning for Video Anomaly Detection
Antonio Barbalau, Radu Tudor Ionescu, Mariana-Iuliana Georgescu, Jacob, Dueholm, Bharathkumar Ramachandra, Kamal Nasrollahi, Fahad Shahbaz Khan,, Thomas B. Moeslund, Mubarak Shah

TL;DR
This paper enhances the self-supervised multi-task learning framework for video anomaly detection by integrating vision transformers, exploring new self-supervised tasks, and demonstrating improved performance across multiple datasets.
Contribution
The paper introduces SSMTL++ with updated detection methods, vision transformer backbones, and additional self-supervised tasks, achieving state-of-the-art results in video anomaly detection.
Findings
Improved detection of anomalies across datasets.
Vision transformer modules enhance model accuracy.
New self-supervised tasks contribute to performance gains.
Abstract
A self-supervised multi-task learning (SSMTL) framework for video anomaly detection was recently introduced in literature. Due to its highly accurate results, the method attracted the attention of many researchers. In this work, we revisit the self-supervised multi-task learning framework, proposing several updates to the original method. First, we study various detection methods, e.g. based on detecting high-motion regions using optical flow or background subtraction, since we believe the currently used pre-trained YOLOv3 is suboptimal, e.g. objects in motion or objects from unknown classes are never detected. Second, we modernize the 3D convolutional backbone by introducing multi-head self-attention modules, inspired by the recent success of vision transformers. As such, we alternatively introduce both 2D and 3D convolutional vision transformer (CvT) blocks. Third, in our attempt to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Digital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Layer Normalization · Average Pooling · Convolution · Global Average Pooling · Softmax · k-Means Clustering
