# Type 2 Diabetes in Taiwan: Unmasking Influential Factors Through Advanced Predictive Modeling

**Authors:** Shih-Tsung Chang, Ying-Hsiang Chou, Oswald Ndi Nfor, Ji-Han Zhong, Chien-Ning Huang, Yung-Po Liaw

PMC · DOI: 10.1155/jdr/5531934 · Journal of Diabetes Research · 2025-05-27

## TL;DR

This study uses machine learning to identify key factors influencing Type 2 Diabetes in Taiwan, finding that waist–hip ratio is the most important predictor.

## Contribution

The study introduces advanced predictive modeling to uncover influential factors for T2D in the Taiwanese population.

## Key findings

- WHR was the most influential predictor of T2D with an odds ratio of 1.10.
- Random forest and ensemble classifiers showed the best predictive performance, especially for females.
- BMI's importance varied by gender, being more significant in women than in men.

## Abstract

Background: Type 2 diabetes (T2D) is influenced by lifestyle, genetics, and environmental conditions. By utilizing machine learning techniques, we can enhance the precision of T2D risk prediction by analyzing the complex interactions among these variables. This study was aimed at identifying and predicting key variables linked to T2D within the Taiwanese population.

Methods: The study included 3623 individuals with T2D and 14,492 without. Data on lifestyle and anthropometric measures were obtained from the Taiwan Biobank. Statistical analyses were performed using Base SAS software and SAS Viya.

Results: Traditional models identified body mass index (BMI) and waist–hip ratio (WHR) as significant risk factors for T2D, with odds ratios (OR) of 1.10 (95% confidence interval (CI) 1.09–1.12) and 1.10 (95% CI 1.09–1.11), respectively. These variables remained crucial in predictive models, with the WHR being the most influential. In the overall population, BMI's relative importance was 0.57, differing by gender (0.23 in men and 0.62 in women). While cigarette smoking and certain genetic variants (CDKAL1, SLC30A8, CDKN2B, KCNQ1, HHEX, and TCF7L2) were significant in traditional models, their importance decreased in predictive models.

Conclusions: Among various factors, the WHR emerged as the most critical attribute for T2D, underscoring the complexity of T2D etiology. Overall, the random forest and ensemble classifiers emerge as the most effective models, especially in mixed and female categories, highlighting their robustness in predictive performance.

## Linked entities

- **Genes:** CDKAL1 (CDKAL1 threonylcarbamoyladenosine tRNA methylthiotransferase) [NCBI Gene 54901], SLC30A8 (solute carrier family 30 member 8) [NCBI Gene 169026], CDKN2B (cyclin dependent kinase inhibitor 2B) [NCBI Gene 1030], KCNQ1 (potassium voltage-gated channel subfamily Q member 1) [NCBI Gene 3784], HHEX (hematopoietically expressed homeobox) [NCBI Gene 3087], TCF7L2 (transcription factor 7 like 2) [NCBI Gene 6934]
- **Diseases:** Type 2 diabetes (MONDO:0005148)

## Full-text entities

- **Genes:** KCNQ1 (potassium voltage-gated channel subfamily Q member 1) [NCBI Gene 3784] {aka ATFB1, ATFB3, JLNS1, KCNA8, KCNA9, KVLQT1}, CDKAL1 (CDKAL1 threonylcarbamoyladenosine tRNA methylthiotransferase) [NCBI Gene 54901], SLC30A8 (solute carrier family 30 member 8) [NCBI Gene 169026] {aka ZNT8, ZnT-8}, CDKN2B (cyclin dependent kinase inhibitor 2B) [NCBI Gene 1030] {aka CDK4I, INK4B, MTS2, P15, TP15, p15INK4b}, TCF7L2 (transcription factor 7 like 2) [NCBI Gene 6934] {aka TCF-4, TCF4}, HHEX (hematopoietically expressed homeobox) [NCBI Gene 3087] {aka HEX, HMPH, HOX11L-PEN, PRH, PRHX}
- **Diseases:** T2D (MESH:D003924)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12133368/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12133368/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/PMC12133368/full.md

---
Source: https://tomesphere.com/paper/PMC12133368