Dual Memory Units with Uncertainty Regulation for Weakly Supervised Video Anomaly Detection
Hang Zhou, Junqing Yu, Wei Yang

TL;DR
This paper introduces UR-DMU, a novel model for weakly supervised video anomaly detection that learns normal and abnormal representations using dual memory banks and uncertainty regulation, improving detection accuracy.
Contribution
The paper proposes an Uncertainty Regulated Dual Memory Units model with a GL-MHSA module and separate memory banks for normal and abnormal data, enhancing discriminative feature learning.
Findings
Outperforms state-of-the-art on XD-Violence and UCF-Crime datasets.
Effectively separates normal and abnormal prototypes.
Robust to noise from scene changes and camera switching.
Abstract
Learning discriminative features for effectively separating abnormal events from normality is crucial for weakly supervised video anomaly detection (WS-VAD) tasks. Existing approaches, both video and segment-level label oriented, mainly focus on extracting representations for anomaly data while neglecting the implication of normal data. We observe that such a scheme is sub-optimal, i.e., for better distinguishing anomaly one needs to understand what is a normal state, and may yield a higher false alarm rate. To address this issue, we propose an Uncertainty Regulated Dual Memory Units (UR-DMU) model to learn both the representations of normal data and discriminative features of abnormal data. To be specific, inspired by the traditional global and local structure on graph convolutional networks, we introduce a Global and Local Multi-Head Self Attention (GL-MHSA) module for the Transformer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Network Security and Intrusion Detection · Artificial Immune Systems Applications
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Dropout · Layer Normalization · Dense Connections · Position-Wise Feed-Forward Layer · Adam · Label Smoothing
