Cross-Modal Fusion and Attention Mechanism for Weakly Supervised Video Anomaly Detection
Ayush Ghadiya, Purbayan Kar, Vishal Chudasama, Pankaj Wasnik

TL;DR
This paper presents a novel multi-modal framework for weakly supervised video anomaly detection, utilizing a cross-modal fusion adapter and hierarchical graph attention to improve accuracy in identifying anomalies like violence and nudity.
Contribution
It introduces the Cross-modal Fusion Adapter and Hyperbolic Lorentzian Graph Attention, advancing multi-modal fusion and hierarchical feature modeling in weakly supervised video anomaly detection.
Findings
Achieves state-of-the-art results on benchmark datasets.
Effectively distinguishes between normal and abnormal features.
Enhances audio-visual feature relevance and separation.
Abstract
Recently, weakly supervised video anomaly detection (WS-VAD) has emerged as a contemporary research direction to identify anomaly events like violence and nudity in videos using only video-level labels. However, this task has substantial challenges, including addressing imbalanced modality information and consistently distinguishing between normal and abnormal features. In this paper, we address these challenges and propose a multi-modal WS-VAD framework to accurately detect anomalies such as violence and nudity. Within the proposed framework, we introduce a new fusion mechanism known as the Cross-modal Fusion Adapter (CFA), which dynamically selects and enhances highly relevant audio-visual features in relation to the visual modality. Additionally, we introduce a Hyperbolic Lorentzian Graph Attention (HLGAtt) to effectively capture the hierarchical relationships between normal and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Network Security and Intrusion Detection · Artificial Immune Systems Applications
MethodsSoftmax · Attention Is All You Need · Adapter
