# Developing count regression techniques for predicting the number of new type 2 diabetes cases in Saudi Arabia

**Authors:** Faten Al-hussein, Laleh Tafakori, Mali Abdollahian, Khalid Al-Shali

PMC · DOI: 10.1371/journal.pone.0341436 · PLOS One · 2026-01-23

## TL;DR

This paper develops a predictive model to estimate new type 2 diabetes cases in Saudi Arabia using count regression techniques and identifies key factors influencing the disease.

## Contribution

The study introduces a reduced count regression model using five key indicators for predicting T2D cases in Saudi Arabia.

## Key findings

- The full Negative Binomial Regression (NBR) model outperformed other models with R² of 0.88 and RMSE of 0.93.
- The reduced NBR model using five key variables (marital status, age, BMI, TC, HDL) also outperformed other reduced models.
- The reduced model was validated as adequate with no significant difference compared to the full model (p = 0.694).

## Abstract

Type 2 diabetes (T2D) is a chronic condition affecting millions globally. A robust predictive model to estimate the number of new cases of T2D can facilitate precise monitoring and effective intervention strategies. This study aims to predict the number of new T2D cases per month in Saudi Arabia and identify the Key Performance Indicators (KPIs) associated with T2D, using count regression models, Poisson Regression (PR), Negative Binomial Regression (NBR), Poisson Inverse Gaussian Regression (PIGR), and Bell Regression (BR). De-identified data from 1,000 patients with T2D in Saudi Arabia were used to develop the models. The performance of the full models, which include recommended Key Performance Indicators (KPIs), is compared using metrics such as the coefficient of determination (R2), root mean squared error (RMSE), mean absolute error (MAE), 10-fold cross-validation (CV-10), Akaike information criterion (AIC), and Bayesian information criterion (BIC). The most significant KPIs identified by the full models were utilized to develop the reduced models. The full NBR model outperformed other models, achieving R² of 0.88, RMSE of 0.93, MAE of 0.69, CV-10 of 1.21, AIC = 873.23, and BIC = 880. The reduced NBR model, focusing solely on the five most influential variables (marital status, age, body mass index (BMI), total cholesterol (TC), and high-density lipoprotein (HDL)), with R² = 0.84, RMSE = 1.10, MAE = 0.86, CV-10 = 1.37, AIC = 899, and BIC = 910, also outperformed other reduced models. The Likelihood Ratio Test (LRT) did not show a significant difference between the full and reduced NBR models (p = 0.694), supporting the adequacy of the reduced model. The proposed reduced model, utilizing only five significant KPIs, can help healthcare providers develop effective, targeted strategies by monitoring a smaller number of KPIs to reduce the rising number of T2D cases in Saudi Arabia.

## Linked entities

- **Diseases:** Type 2 diabetes (MONDO:0005148), T2D (MONDO:0005148)

## Full-text entities

- **Genes:** PIGR (polymeric immunoglobulin receptor) [NCBI Gene 5284], PGR (progesterone receptor) [NCBI Gene 5241] {aka NR3C3, PR}, ARHGEF16 (Rho guanine nucleotide exchange factor 16) [NCBI Gene 27237] {aka GEF16, NBR}, UQCC6 (ubiquinol-cytochrome c reductase complex assembly factor 6) [NCBI Gene 728568] {aka BR, BRAWNIN, C12orf73}, INS (insulin) [NCBI Gene 3630] {aka IDDM, IDDM1, IDDM2, ILPR, IRDN, MODY10}
- **Diseases:** vitamin D deficiency (MESH:D014808), PR (MESH:C537770), T2D (MESH:D003924), hypertension (MESH:D006973), amputation (MESH:C565682), Obesity (MESH:D009765), restricted mobility (MESH:D014086), inflammatory (MESH:D007249), dyslipidemia (MESH:D050171), nutritional insufficiency (MESH:D000309), metabolic dysregulation (MESH:D021081), prediabetes (MESH:D011236), overweight (MESH:D050177), Diabetes (MESH:D003920), metabolic disturbances (MESH:D024821), type 1 diabetes (MESH:D003922), insulin resistance (MESH:D007333)
- **Chemicals:** TC (-), glucose (MESH:D005947), lipid (MESH:D008055), vitamin D (MESH:D014807), TG (MESH:D014280), Cholesterol (MESH:D002784)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12829827/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12829827/full.md

## References

52 references — full list in the complete paper: https://tomesphere.com/paper/PMC12829827/full.md

---
Source: https://tomesphere.com/paper/PMC12829827