Rethinking Metrics and Benchmarks of Video Anomaly Detection
Zihao Liu, Xiaoyu Wu, Wenna Li, Linlin Yang, Shengjin Wang

TL;DR
This paper critically examines current evaluation metrics and benchmarks in Video Anomaly Detection, proposing new methods to address biases, reward early detection, and evaluate scene overfitting, thereby advancing the assessment standards in VAD research.
Contribution
It introduces three novel evaluation metrics and two benchmarks to improve the assessment of VAD models, focusing on bias mitigation, early detection, and overfitting evaluation.
Findings
Prob-AUC/AP reduces annotation bias effects.
LaAP rewards early and accurate detection.
New benchmarks evaluate scene overfitting effectively.
Abstract
Video Anomaly Detection (VAD), which aims to detect anomalies that deviate from expectation, has attracted increasing attention in recent years. Existing advancements in VAD primarily focus on model architectures and training strategies, while devoting insufficient attention to evaluation metrics and benchmarks. In this paper, we rethink VAD evaluation methods through comprehensive analyses, revealing three critical limitations in current practices: 1) existing metrics are significantly influenced by single annotation bias; 2) current metrics fail to reward early detection of anomalies; 3) available benchmarks lack the capability to evaluate scene overfitting of fully/weakly-supervised algorithms. To address these limitations, we propose three novel evaluation methods: first, we establish probabilistic AUC/AP (Prob-AUC/AP) metrics utlizing multi-round annotations to mitigate single…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Network Security and Intrusion Detection · Generative Adversarial Networks and Image Synthesis
MethodsSoftmax · Attention Is All You Need · Focus
