Deep Transformer Model with Pre-Layer Normalization for COVID-19 Growth   Prediction

Rizki Ramadhan Fitra; Novanto Yudistira; Wayan Firdaus Mahmudy

arXiv:2207.06356·cs.LG·July 14, 2022

Deep Transformer Model with Pre-Layer Normalization for COVID-19 Growth Prediction

Rizki Ramadhan Fitra, Novanto Yudistira, Wayan Firdaus Mahmudy

PDF

Open Access

TL;DR

This paper introduces a Deep Transformer model with pre-layer normalization for predicting COVID-19 case growth, demonstrating superior accuracy over LSTM and RNN models using MAPE as the evaluation metric.

Contribution

The study proposes a novel Deep Transformer architecture with pre-layer normalization for COVID-19 growth prediction, optimizing hyperparameters and outperforming traditional RNN-based models.

Findings

01

Deep Transformer with pre-layer normalization achieves MAPE of 18.83.

02

The Adamax optimizer yields the best performance.

03

Deep Transformer outperforms LSTM and RNN models.

Abstract

Coronavirus disease or COVID-19 is an infectious disease caused by the SARS-CoV-2 virus. The first confirmed case caused by this virus was found at the end of December 2019 in Wuhan City, China. This case then spread throughout the world, including Indonesia. Therefore, the COVID-19 case was designated as a global pandemic by WHO. The growth of COVID-19 cases, especially in Indonesia, can be predicted using several approaches, such as the Deep Neural Network (DNN). One of the DNN models that can be used is Deep Transformer which can predict time series. The model is trained with several test scenarios to get the best model. The evaluation is finding the best hyperparameters. Then, further evaluation was carried out using the best hyperparameters setting of the number of prediction days, the optimizer, the number of features, and comparison with the former models of the Long Short-Term…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Mining and Machine Learning Applications · Edcuational Technology Systems · Computer Science and Engineering

MethodsMulti-Head Attention · Attention Is All You Need · Test · Linear Layer · Softmax · Dense Connections · Absolute Position Encodings · Dropout · Byte Pair Encoding · Position-Wise Feed-Forward Layer