# Construction and validation of a risk prediction model for metabolic syndrome: a cross-sectional study based on randomized sampling

**Authors:** Jiannan Zhao, Xinhua An, Ling Liu, Jia Meng, Liyong Liu, Yongliang Mu

PMC · DOI: 10.3389/fendo.2025.1761342 · 2026-01-16

## TL;DR

This study builds a model to predict metabolic syndrome risk in Chinese adults, showing that factors like age, BMI, and occupation are key predictors.

## Contribution

A novel column-line graph prediction model for metabolic syndrome risk with validated discriminative and predictive performance.

## Key findings

- The model achieved AUCs of 0.815 (training) and 0.787 (validation), showing strong discrimination.
- BMI and age were identified as the most significant risk factors for metabolic syndrome.
- The model's calibration improved after Platt Scaling, with good fit confirmed by statistical tests.

## Abstract

The prevalence of metabolic syndrome is high among Chinese residents, and it is crucial to understand the current situation and intervene promptly. In this study, we investigated the current status of metabolic syndrome in some regions of China, analyzed related risk factors, and developed a risk prediction model to guide preventive measures.

A multistage stratified cluster random sampling method was used to select 3541 permanent residents aged 18-79 years from a district in Beijing for face-to-face questionnaire surveys, physical examinations, and laboratory tests. All participants were randomly divided into training and validation sets. Correlation analysis and multivariate logistic regression were employed to identify risk factors for metabolic syndrome, and a column-line graph prediction model was developed. The discriminative ability and predictive accuracy of the model were assessed by receiver operating characteristic (ROC) curve and calibration curves.

The prevalence of metabolic syndrome in this study was 18.4%. The results of multivariate logistic regression analysis showed that increasing age, being male (OR = 1.827), being overweight (OR = 4.865), being obese (OR = 11.482), hazardous alcohol consumption (OR = 1.673), marital/cohabitation history, and specific occupations (agriculture, forestry, fisheries, and water production, and unemployed) were independent risk factors for metabolic syndrome (P < 0.05). The column-line graph prediction model, constructed accordingly, performed well, and the model indicated that BMI and age were the most significant risk factors for metabolic syndrome. The results of model validation showed that the AUCs of the training and validation sets were 0.815 (95% CI: 0.795-0.836) and 0.787 (95% CI: 0.756-0.818), respectively, indicating that the model performed well in discriminating. The model exhibited a good fit on the training set (with 1000 resamples via the Bootstrap method; calibration slope = 1.002, calibration intercept = 0.005; Hosmer-Lemeshow test, P = 0.127 > 0.05). The initial validation set showed signs of slight overfitting, and after probability calibration using the Platt Scaling method, the model’s calibration performance was significantly improved (calibration slope = 0.92, calibration intercept = -0.03; Hosmer-Lemeshow test, P = 0.082 > 0.05). The Brier scores of the training set and validation set were 0.120 and 0.136, respectively, with mean absolute errors (MAE) of 0.240 and 0.256, indicating that the model had favorable predictive accuracy and stability.

The metabolic syndrome column-line diagram risk prediction model constructed in this study, based on multivariate logistic regression analysis, has good discriminative ability and high prediction accuracy. The model shows that a large proportion of the current risk factors for metabolic syndrome are modifiable, and that the risk of metabolic syndrome in high-risk groups, such as the elderly, men, people with marital/cohabitation histories, and people with specific occupations, can be reduced through behavioral and lifestyle interventions. This model can provide a scientific basis for the early identification of high-risk groups for metabolic syndrome and has an important guiding value for targeted preventive interventions.

## Linked entities

- **Diseases:** metabolic syndrome (MONDO:0000816)
- **Species:** Homo sapiens (taxon 9606)

## Full-text entities

- **Diseases:** obese (MESH:D009765), metabolic syndrome (MESH:D024821), overweight (MESH:D050177)
- **Chemicals:** alcohol (MESH:D000438)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12855082/full.md

---
Source: https://tomesphere.com/paper/PMC12855082