Weakly-Supervised Spatiotemporal Anomaly Detection

Urvi Gianchandani; Praveen Tirupattur; Mubarak Shah

arXiv:2605.13746·cs.CV·May 14, 2026

Weakly-Supervised Spatiotemporal Anomaly Detection

Urvi Gianchandani, Praveen Tirupattur, Mubarak Shah

PDF

TL;DR

This paper introduces a weakly supervised approach for detecting anomalies in videos, leveraging only video-level labels to identify spatiotemporal regions of interest without detailed annotations.

Contribution

It proposes a novel weakly supervised method using multiple instance learning to localize anomalies in both space and time within videos.

Findings

01

Effective detection of anomalies using only video-level labels.

02

Utilization of multiple instance ranking loss for localization.

03

Validated on UCF Crime2Local Dataset with promising results.

Abstract

In this paper, we explore a weakly supervised method for anomaly detection. Since annotating videos is time-consuming, we only look at weak video-level labels during training. This means that given a video, we know that it is either normal or contains an anomaly, but no further annotations are used to train the network. Features are extracted from video clips that are either normal or anomalous. These features are used to determine anomaly scores for spatiotemporal regions of the clips based on a classifier and the implementation of a multiple instance ranking loss (MIL). We represent both anomalous and normal video clips as positive and negative bags, respectively, to apply MIL. Furthermore, since anomalies are usually localized to a part of a frame rather than the whole frame, we chose to explore temporal as well as spatial anomaly detection. We show our results on the UCF Crime2Local…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.