Event and Activity Recognition in Video Surveillance for Cyber-Physical Systems
Swarnabja Bhaumik, Prithwish Jana, Partha Pratim Mohanta

TL;DR
This paper reviews methods for recognizing events and activities in video surveillance for cyber-physical systems, emphasizing motion patterns, deep learning architectures, and fusion strategies to improve classification accuracy on benchmark datasets.
Contribution
It introduces a hybrid CNN+RNN approach with multi-tier fusion for spatial and temporal features, achieving state-of-the-art results in event recognition.
Findings
Outperforms standard temporal CNNs in event recognition.
Fusion strategy improves classification precision.
Effective on multiple benchmark datasets.
Abstract
This chapter aims to aid the development of Cyber-Physical Systems (CPS) in automated understanding of events and activities in various applications of video-surveillance. These events are mostly captured by drones, CCTVs or novice and unskilled individuals on low-end devices. Being unconstrained, these videos are immensely challenging due to a number of quality factors. We present an extensive account of the various approaches taken to solve the problem over the years. This ranges from methods as early as Structure from Motion (SFM) based approaches to recent solution frameworks involving deep neural networks. We show that the long-term motion patterns alone play a pivotal role in the task of recognizing an event. Consequently each video is significantly represented by a fixed number of key-frames using a graph-based approach. Only the temporal features are exploited using a hybrid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
