Evaluation of Time Series Forecasting Models for Predicting Lung Cancer Mortality Rates in the United States: A Comparison with Altuhaifa (2023) Study

E. Kubuafor; D. Baidoo; O. J. Okeke; R. Amevor; G. Arhin; J. T. Korley

arXiv:2508.16052·stat.AP·August 25, 2025

Evaluation of Time Series Forecasting Models for Predicting Lung Cancer Mortality Rates in the United States: A Comparison with Altuhaifa (2023) Study

E. Kubuafor, D. Baidoo, O. J. Okeke, R. Amevor, G. Arhin, J. T. Korley

PDF

TL;DR

This study compares and extends time series models for predicting US lung cancer mortality rates, demonstrating that ARIMA and Holt's Double Exponential Smoothing provide highly accurate forecasts with potential public health implications.

Contribution

It updates previous models with extended data up to 2021 and introduces an average model combining HDES and ARIMA for improved accuracy.

Findings

01

ARIMA (0,2,2) and HDES achieve lowest RMSE of 2.56

02

Extended dataset improves model accuracy and insights

03

Average HDES-ARIMA model maintains high forecast precision

Abstract

This paper evaluates the performance of the following time series forecasting models - Simple Exponential Smoothing (SES), Holt's Double Exponential Smoothing (HDES), and Autoregressive Integrated Moving Average (ARIMA) - in predicting lung cancer mortality rates in the United States. It builds upon the work of Altuhaifa, which used Surveillance, Epidemiology, and End Results (SEER) data from 1975-2018 to evaluate these models. Altuhaifa's study found that ARIMA (0,2,2), SES with smoothing parameter $α = 0.995$ , and HDES with parameters $α = 0.4$ and $β = 0.9$ were the optimal models from their analysis, with HDES providing the lowest Root Mean Squared Error (RMSE) of 132.91. The paper extends the dataset to 2021 and re-evaluates the models. Using the same SEER data from 1975-2021, it identifies ARIMA (0,2,2), SES ( $α = 0.999$ ), and HDES ( $α = 0.5221$ , $β = 0.5219$ ) as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.