Towards the Design of an End-to-End Automated System for Image and Video-based Recognition
Rama Chellappa, Jun-Cheng Chen, Rajeev Ranjan, Swami Sankaranarayanan,, Amit Kumar, Vishal M. Patel, Carlos D. Castillo

TL;DR
This paper reviews the evolution of computer vision and neural networks, and details a deep learning system for end-to-end face recognition, highlighting recent advances and open challenges.
Contribution
It provides a historical overview and presents a novel deep learning system for unconstrained face verification and recognition.
Findings
Deep CNNs have significantly improved object recognition performance.
Availability of large annotated datasets and GPUs has accelerated progress.
Open issues in DCNNs for object recognition are discussed.
Abstract
Over many decades, researchers working in object recognition have longed for an end-to-end automated system that will simply accept 2D or 3D image or videos as inputs and output the labels of objects in the input data. Computer vision methods that use representations derived based on geometric, radiometric and neural considerations and statistical and structural matchers and artificial neural network-based methods where a multi-layer network learns the mapping from inputs to class labels have provided competing approaches for image recognition problems. Over the last four years, methods based on Deep Convolutional Neural Networks (DCNNs) have shown impressive performance improvements on object detection/recognition challenge problems. This has been made possible due to the availability of large annotated data, a better understanding of the non-linear mapping between image and class…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
