Vision-Language Models Assisted Unsupervised Video Anomaly Detection

Yalong Jiang; Liquan Mao

arXiv:2409.14109·cs.CV·September 27, 2024

Vision-Language Models Assisted Unsupervised Video Anomaly Detection

Yalong Jiang, Liquan Mao

PDF

Open Access

TL;DR

This paper introduces VLAVAD, a novel unsupervised video anomaly detection method that leverages cross-modal pre-trained models, language models, and semantic analysis to improve detection accuracy and interpretability.

Contribution

The paper proposes VLAVAD, integrating large language models and a sequence state space module to enhance unsupervised anomaly detection in videos, addressing prior limitations.

Findings

01

Achieves state-of-the-art performance on ShanghaiTech dataset.

02

Effectively detects elusive and temporally subtle anomalies.

03

Improves interpretability of anomaly detection results.

Abstract

Video anomaly detection is a subject of great interest across industrial and academic domains due to its crucial role in computer vision applications. However, the inherent unpredictability of anomalies and the scarcity of anomaly samples present significant challenges for unsupervised learning methods. To overcome the limitations of unsupervised learning, which stem from a lack of comprehensive prior knowledge about anomalies, we propose VLAVAD (Video-Language Models Assisted Anomaly Detection). Our method employs a cross-modal pre-trained model that leverages the inferential capabilities of large language models (LLMs) in conjunction with a Selective-Prompt Adapter (SPA) for selecting semantic space. Additionally, we introduce a Sequence State Space Module (S3M) that detects temporal inconsistencies in semantic features. By mapping high-dimensional visual features to low-dimensional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Network Security and Intrusion Detection · Digital Media Forensic Detection

MethodsAdapter