Deep Learning Architectures for Face Recognition in Video Surveillance
Saman Bashbaghi, Eric Granger, Robert Sabourin, Mostafa Parchami

TL;DR
This paper reviews recent deep learning architectures for face recognition in video surveillance, focusing on methods that handle variations in uncontrolled environments using CNNs and autoencoders.
Contribution
It provides a comprehensive overview and comparison of recent deep learning models for still-to-video face recognition in surveillance scenarios.
Findings
Deep CNNs with triplet-loss improve face recognition accuracy.
Supervised autoencoders offer efficient face representation.
Comparative analysis highlights trade-offs in accuracy and complexity.
Abstract
Face recognition (FR) systems for video surveillance (VS) applications attempt to accurately detect the presence of target individuals over a distributed network of cameras. In video-based FR systems, facial models of target individuals are designed a priori during enrollment using a limited number of reference still images or video data. These facial models are not typically representative of faces being observed during operations due to large variations in illumination, pose, scale, occlusion, blur, and to camera inter-operability. Specifically, in still-to-video FR application, a single high-quality reference still image captured with still camera under controlled conditions is employed to generate a facial model to be matched later against lower-quality faces captured with video cameras under uncontrolled conditions. Current video-based FR systems can perform well on controlled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Biometric Identification and Security · Video Surveillance and Tracking Methods
