A Unified Reasoning Framework for Holistic Zero-Shot Video Anomaly Analysis
Dongheng Lin, Mengxue Qu, Kunyang Han, Jianbo Jiao, Xiaojie Jin, Yunchao Wei

TL;DR
This paper introduces a unified, zero-shot reasoning framework that integrates temporal detection, spatial localization, and semantic explanation for video anomaly analysis, enhancing interpretability and generalization without additional training.
Contribution
It presents a novel chained reasoning approach that connects multiple tasks in a zero-shot setting, improving holistic anomaly understanding in videos.
Findings
Achieves state-of-the-art zero-shot performance on multiple benchmarks.
Enhances interpretability through integrated spatial and semantic explanations.
Demonstrates effective reasoning power of foundation models in video analysis.
Abstract
Most video-anomaly research stops at frame-wise detection, offering little insight into why an event is abnormal, typically outputting only frame-wise anomaly scores without spatial or semantic context. Recent video anomaly localization and video anomaly understanding methods improve explainability but remain data-dependent and task-specific. We propose a unified reasoning framework that bridges the gap between temporal detection, spatial localization, and textual explanation. Our approach is built upon a chained test-time reasoning process that sequentially connects these tasks, enabling holistic zero-shot anomaly analysis without any additional training. Specifically, our approach leverages intra-task reasoning to refine temporal detections and inter-task chaining for spatial and semantic understanding, yielding improved interpretability and generalization in a fully zero-shot manner.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Human Pose and Action Recognition · Video Analysis and Summarization
