Mixture of Experts Guided by Gaussian Splatters Matters: A new Approach to Weakly-Supervised Video Anomaly Detection
Giacomo D'Amicantonio, Snehashis Majhi, Quan Kong, Lorenzo Garattoni, Gianpiero Francesca, Fran\c{c}ois Bremond, Egor Bondarev

TL;DR
This paper introduces GS-MoE, a novel weakly-supervised video anomaly detection framework that employs specialized experts guided by Gaussian splatting to improve detection of complex anomalies.
Contribution
The paper proposes Gaussian Splatting-guided Mixture of Experts (GS-MoE), a new approach that leverages category-specific experts and temporal guidance to enhance weakly-supervised VAD performance.
Findings
Achieves 91.58% AUC on UCF-Crime dataset.
Outperforms existing methods on XD-Violence and MSAD datasets.
Demonstrates the effectiveness of category-specific experts and temporal guidance.
Abstract
Video Anomaly Detection (VAD) is a challenging task due to the variability of anomalous events and the limited availability of labeled data. Under the Weakly-Supervised VAD (WSVAD) paradigm, only video-level labels are provided during training, while predictions are made at the frame level. Although state-of-the-art models perform well on simple anomalies (e.g., explosions), they struggle with complex real-world events (e.g., shoplifting). This difficulty stems from two key issues: (1) the inability of current models to address the diversity of anomaly types, as they process all categories with a shared model, overlooking category-specific features; and (2) the weak supervision signal, which lacks precise temporal information, limiting the ability to capture nuanced anomalous patterns blended with normal events. To address these challenges, we propose Gaussian Splatting-guided Mixture…
Peer Reviews
Decision·Submitted to ICLR 2025
Innovation: This paper proposes a new framework for weakly-supervised video anomaly detection framework that combines Gaussian splatter loss and mixture-of-expert architecture. The idea has some novelty in the field of video anomaly detection. Performance: Experimental results show that the proposed GS-MoE achieves better performances than existing SOTA methods on both UCF Crime and XD Violence datasets with significant improvements, especially in handling complex abnormal events. Presentation:
My concern about this paper mainly focuses on the computational complexity due to the use of a mixture-of-expert architecture, where the authors assign an expert for each anomaly. It seems that the proposed approach may consume a significant amount of resources. I suggest the authors discuss on the inference speed of the proposed model (such as FPS) as well in Table 1, so that the advantages and disadvantages of the model can be better illustrated.
1. The paper is well-written and clearly structured, making the ideas easy to follow. 2. The proposed GS-MoE framework is well-motivated, effectively addressing key limitations in weakly-supervised video anomaly detection. It introduces Temporal Gaussian Splatting (TGS) to reduce over-dependency on the most abnormal snippets, allowing the model to capture subtle temporal patterns across a broader range of anomaly cues. The Mixture of Experts (MoE) architecture further enhances performance by le
1. In Table 3, there seems to be a labeling error. The column labeled "With skip connect" should likely be labeled "With task-aware features"
Pros: 1. Usage of Gaussian kernels extracted from the estimated abnormal scores to generate complete representation 2. Splatting the kernels along the temporal dimension for modeling anomalous temporal dependencies 3. Dedicated class-expert models focus on individual anomaly types 4. Improvements on benchmarks.
Cons: 1. Some inappropriate expressisons and lack in sufficient literature review 2. Each expert is trained only on refined features belonging to its assigned class and to the normal class. Why different classes are pre-defined like this, what if different classes are coupled? Anomalies are unexpected, it is difficult to define classes. The mixture of experts assumes that different anomaly types can be isolated effectively. However, in real-world applications, anomalies might not always fi
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Artificial Immune Systems Applications · Advanced Statistical Methods and Models
