A comprehensive overview of deep learning models for object detection from videos/images
Sukana Zulfqar, Sadia Saeed, M. Azam Zia, Anjum Ali, Faisal Mehmood, Abid Ali

TL;DR
This paper provides a comprehensive review of deep learning techniques for object detection in videos and images, emphasizing architectural innovations, generative models, and temporal information to improve robustness and accuracy.
Contribution
It classifies and evaluates modern deep learning methods based on architectures, data strategies, and surveillance challenges, offering insights into current effectiveness and future trends.
Findings
CNN-based detectors are effective in various scenarios.
Generative models aid in reconstructing missing data and reducing occlusions.
Temporal fusion improves detection robustness in dynamic environments.
Abstract
Object detection in video and image surveillance is a well-established yet rapidly evolving task, strongly influenced by recent deep learning advancements. This review summarises modern techniques by examining architectural innovations, generative model integration, and the use of temporal information to enhance robustness and accuracy. Unlike earlier surveys, it classifies methods based on core architectures, data processing strategies, and surveillance specific challenges such as dynamic environments, occlusions, lighting variations, and real-time requirements. The primary goal is to evaluate the current effectiveness of semantic object detection, while secondary aims include analysing deep learning models and their practical applications. The review covers CNN-based detectors, GAN-assisted approaches, and temporal fusion methods, highlighting how generative models support tasks such…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Advanced Data and IoT Technologies
