A Comprehensive Overview and Comparative Analysis on Deep Learning Models: CNN, RNN, LSTM, GRU
Farhad Mortezapour Shiri, Thinagaran Perumal, Norwati Mustapha,, Raihani Mohamed

TL;DR
This paper provides a comprehensive survey and comparative analysis of various deep learning models, examining their structures, applications, and performance across multiple datasets and tasks.
Contribution
It offers a detailed comparison of multiple deep learning architectures, including recent models like TCN and Transformer, on diverse datasets for different applications.
Findings
Transformers outperform traditional RNNs on text datasets.
ResNet and DenseNet excel in image classification tasks.
Bidirectional models improve performance in sequential data analysis.
Abstract
Deep learning (DL) has emerged as a powerful subset of machine learning (ML) and artificial intelligence (AI), outperforming traditional ML methods, especially in handling unstructured and large datasets. Its impact spans across various domains, including speech recognition, healthcare, autonomous vehicles, cybersecurity, predictive analytics, and more. However, the complexity and dynamic nature of real-world problems present challenges in designing effective deep learning models. Consequently, several deep learning models have been developed to address different problems and applications. In this article, we conduct a comprehensive survey of various deep learning models, including Convolutional Neural Network (CNN), Recurrent Neural Network (RNN), Temporal Convolutional Networks (TCN), Transformer, Kolmogorov-Arnold networks (KAN), Generative Models, Deep Reinforcement Learning (DRL),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Anomaly Detection Techniques and Applications · Traffic Prediction and Management Techniques
MethodsTanh Activation · Sigmoid Activation · Gated Recurrent Unit · Long Short-Term Memory
