Cross Pseudo Labeling For Weakly Supervised Video Anomaly Detection
Dayeon Lee, Donghyeong Kim, Chaewon Park, Sungmin Woo, Sangyoun Lee

TL;DR
This paper introduces CPL-VAD, a dual-branch framework using cross pseudo labeling for weakly supervised video anomaly detection, combining temporal localization and semantic recognition to improve performance.
Contribution
The paper presents a novel dual-branch framework with cross pseudo labeling that enhances weakly supervised video anomaly detection by integrating temporal and semantic information.
Findings
Achieves state-of-the-art results on XD-Violence and UCF-Crime datasets.
Effectively combines temporal localization with semantic recognition.
Demonstrates superior performance in anomaly detection and category classification.
Abstract
Weakly supervised video anomaly detection aims to detect anomalies and identify abnormal categories with only video-level labels. We propose CPL-VAD, a dual-branch framework with cross pseudo labeling. The binary anomaly detection branch focuses on snippet-level anomaly localization, while the category classification branch leverages vision-language alignment to recognize abnormal event categories. By exchanging pseudo labels, the two branches transfer complementary strengths, combining temporal precision with semantic discrimination. Experiments on XD-Violence and UCF-Crime demonstrate that CPL-VAD achieves state-of-the-art performance in both anomaly detection and abnormal category classification.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Human Pose and Action Recognition · Video Analysis and Summarization
