Evaluation of Time Series Forecasting Models for Predicting Lung Cancer Mortality Rates in the United States: A Comparison with Altuhaifa (2023) Study
E. Kubuafor, D. Baidoo, O. J. Okeke, R. Amevor, G. Arhin, J. T. Korley

TL;DR
This study compares and extends time series models for predicting US lung cancer mortality rates, demonstrating that ARIMA and Holt's Double Exponential Smoothing provide highly accurate forecasts with potential public health implications.
Contribution
It updates previous models with extended data up to 2021 and introduces an average model combining HDES and ARIMA for improved accuracy.
Findings
ARIMA (0,2,2) and HDES achieve lowest RMSE of 2.56
Extended dataset improves model accuracy and insights
Average HDES-ARIMA model maintains high forecast precision
Abstract
This paper evaluates the performance of the following time series forecasting models - Simple Exponential Smoothing (SES), Holt's Double Exponential Smoothing (HDES), and Autoregressive Integrated Moving Average (ARIMA) - in predicting lung cancer mortality rates in the United States. It builds upon the work of Altuhaifa, which used Surveillance, Epidemiology, and End Results (SEER) data from 1975-2018 to evaluate these models. Altuhaifa's study found that ARIMA (0,2,2), SES with smoothing parameter , and HDES with parameters and were the optimal models from their analysis, with HDES providing the lowest Root Mean Squared Error (RMSE) of 132.91. The paper extends the dataset to 2021 and re-evaluates the models. Using the same SEER data from 1975-2021, it identifies ARIMA (0,2,2), SES (), and HDES (, ) as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
