Future Video Prediction from a Single Frame for Video Anomaly Detection
Mohammad Baradaran, Robert Bergevin

TL;DR
This paper introduces a novel approach for video anomaly detection by predicting future video frames from a single semantic segmentation map, improving long-term motion modeling and outperforming state-of-the-art methods.
Contribution
The paper proposes a new future video prediction proxy-task using semantic segmentation maps to enhance anomaly detection, addressing long-term motion modeling challenges.
Findings
Effective on benchmark datasets (ShanghaiTech, UCSD-Ped1, UCSD-Ped2)
Outperforms existing prediction-based VAD methods
Improves long-term motion understanding
Abstract
Video anomaly detection (VAD) is an important but challenging task in computer vision. The main challenge rises due to the rarity of training samples to model all anomaly cases. Hence, semi-supervised anomaly detection methods have gotten more attention, since they focus on modeling normals and they detect anomalies by measuring the deviations from normal patterns. Despite impressive advances of these methods in modeling normal motion and appearance, long-term motion modeling has not been effectively explored so far. Inspired by the abilities of the future frame prediction proxy-task, we introduce the task of future video prediction from a single frame, as a novel proxy-task for video anomaly detection. This proxy-task alleviates the challenges of previous methods in learning longer motion patterns. Moreover, we replace the initial and future raw frames with their corresponding semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Artificial Immune Systems Applications · Human Pose and Action Recognition
MethodsAttentive Walk-Aggregating Graph Neural Network · Focus
