Spatio-Temporal-based Context Fusion for Video Anomaly Detection
Chao Hu, Weibin Qiu, Weijie Wu, Liqiang Zhu

TL;DR
This paper introduces a spatio-temporal context fusion method for video anomaly detection, leveraging both spatial relationships and motion features to improve detection accuracy, achieving high AUC scores on benchmark datasets.
Contribution
The paper proposes a novel spatio-temporal dual-stream network that fuses spatial and temporal context for more accurate video anomaly detection.
Findings
Achieves 98.5% AUC on UCSDped2 dataset
Improves detection by 5.1% over temporal stream
Spatial context encoding enhances AUC by 1%
Abstract
Video anomaly detection aims to discover abnormal events in videos, and the principal objects are target objects such as people and vehicles. Each target in the video data has rich spatio-temporal context information. Most existing methods only focus on the temporal context, ignoring the role of the spatial context in anomaly detection. The spatial context information represents the relationship between the detection target and surrounding targets. Anomaly detection makes a lot of sense. To this end, a video anomaly detection algorithm based on target spatio-temporal context fusion is proposed. Firstly, the target in the video frame is extracted through the target detection network to reduce background interference. Then the optical flow map of two adjacent frames is calculated. Motion features are used multiple targets in the video frame to construct spatial context simultaneously,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Video Surveillance and Tracking Methods · Network Security and Intrusion Detection
