Hybrid Architecture for Real-Time Video Anomaly Detection: Integrating Spatial and Temporal Analysis
Fabien Poirier

TL;DR
This paper introduces a hybrid architecture combining spatial and temporal models for real-time video anomaly detection, inspired by human behavior, and evaluates different configurations for improved accuracy.
Contribution
It presents a novel hybrid model integrating YOLOv7 and a CNN+GRU for enhanced real-time video anomaly detection, with comparative analysis of architectural setups.
Findings
Hybrid approach improves detection accuracy
Parallel and series configurations evaluated
Effective integration of spatial and temporal analysis
Abstract
In this paper, we propose a new architecture for real-time anomaly detection in video data, inspired by human behavior combining spatial and temporal analyses. This approach uses two distinct models: (i) for temporal analysis, a recurrent convolutional network (CNN + RNN) is employed, associating VGG19 and a GRU to process video sequences; (ii) regarding spatial analysis, it is performed using YOLOv7 to analyze individual images. These two analyses can be carried out either in parallel, with a final prediction that combines the results of both analysis, or in series, where the spatial analysis enriches the data before the temporal analysis. Some experimentations are been made to compare these two architectural configurations with each other, and evaluate the effectiveness of our hybrid approach in video anomaly detection.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Artificial Immune Systems Applications · Network Security and Intrusion Detection
MethodsGated Recurrent Unit
