Anomaly Locality in Video Surveillance
Federico Landi, Cees G. M. Snoek, Rita Cucchiara

TL;DR
This paper investigates the importance of locality in video anomaly detection by using spatiotemporal tubes, introducing a new annotated dataset, and demonstrating improved detection performance and robustness over whole-frame methods.
Contribution
It introduces the first dataset with bounding box supervision for anomaly detection and shows that using spatiotemporal tubes enhances detection accuracy and robustness.
Findings
Spatiotemporal tubes improve anomaly detection performance.
Locality is robust to errors in tube extraction.
Network can generate proposals using only video-level labels.
Abstract
This paper strives for the detection of real-world anomalies such as burglaries and assaults in surveillance videos. Although anomalies are generally local, as they happen in a limited portion of the frame, none of the previous works on the subject has ever studied the contribution of locality. In this work, we explore the impact of considering spatiotemporal tubes instead of whole-frame video segments. For this purpose, we enrich existing surveillance videos with spatial and temporal annotations: it is the first dataset for anomaly detection with bounding box supervision in both its train and test set. Our experiments show that a network trained with spatiotemporal tubes performs better than its analogous model trained with whole-frame videos. In addition, we discover that the locality is robust to different kinds of errors in the tube extraction phase at test time. Finally, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Network Security and Intrusion Detection · Human Pose and Action Recognition
