Survey on Deep Neural Networks in Speech and Vision Systems
Mahbubul Alam, Manar D. Samad, Lasitha Vidyaratne, Alexander Glandon,, and Khan M. Iftekharuddin

TL;DR
This comprehensive survey reviews recent advances in deep neural network architectures and algorithms for vision and speech systems, emphasizing hardware constraints, industrial efforts, and emerging applications across various disciplines.
Contribution
It provides an extensive overview of state-of-the-art deep learning models, hardware challenges, and future trends in intelligent vision and speech systems, integrating software and hardware perspectives.
Findings
Deep neural networks have significantly advanced vision and speech applications.
Running neural networks efficiently on resource-constrained hardware remains a key challenge.
Emerging applications are expanding the impact of deep learning in diverse fields.
Abstract
This survey presents a review of state-of-the-art deep neural network architectures, algorithms, and systems in vision and speech applications. Recent advances in deep artificial neural network algorithms and architectures have spurred rapid innovation and development of intelligent vision and speech systems. With availability of vast amounts of sensor data and cloud computing for processing and training of deep neural networks, and with increased sophistication in mobile and embedded technology, the next-generation intelligent systems are poised to revolutionize personal and commercial computing. This survey begins by providing background and evolution of some of the most successful deep learning models for intelligent vision and speech systems to date. An overview of large-scale industrial research and development efforts is provided to emphasize future trends and prospects of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
