# Radiomic features and carotid stenosis in periodontitis a two stage bootstrap and multimodal machine learning study

**Authors:** Mengqiang Zhang, Jing Cai, Qian Cao, Zhipeng Chen, Subinuer Maimaitiaili, Shaoxun Yuan, Tao Yang, Zhen Li, Zhen Zhang, Yun Yang, Tong Qiao

PMC · DOI: 10.1038/s41598-026-38463-1 · Scientific Reports · 2026-02-10

## TL;DR

This study uses CBCT radiomic features and machine learning to detect carotid atherosclerosis in periodontitis patients, achieving strong predictive performance.

## Contribution

A novel two-stage bootstrap and multimodal machine learning approach for early detection of carotid atherosclerosis in periodontitis patients.

## Key findings

- A random forest model achieved an AUC of 0.892 in detecting carotid atherosclerosis in periodontitis patients.
- Feature selection identified 20 optimal radiomic features through bootstrap and AIC-based methods.
- The model demonstrated high sensitivity (0.957) and moderate specificity (0.710) in cross-validation.

## Abstract

This study aims to develop and validate a deep learning model based on Cone Beam Computed Tomography (CBCT) radiomic features to achieve early detection of potential carotid atherosclerosis in periodontitis patients. The study utilised data from 279 observations, each with 206 features, to distinguish between periodontitis patients with and without concomitant carotid atherosclerosis. To address class imbalance, Synthetic Minority Over-sampling Technique(SMOTE) oversampling was applied (dup_size = 1), increasing the sample size to 390 observations. A bootstrap method (n_bootstrap = 1000) was employed for feature selection. In each iteration, a dataset was created by resampling with replacement. Features were first filtered using Spearman’s rank correlation to remove redundant variables (correlation coefficient > 0.8), followed by Lasso regression with ten-fold cross-validation to select predictive variables based on non-zero coefficients. High-frequency features identified through 1000 iterations underwent a second round of bootstrap analysis, where Logistic Regression combined with the Akaike Information Criterion (AIC) was used to determine the final variable set. This rigorous process ensured optimal feature selection for developing an effective early detection model for carotid atherosclerosis in periodontitis patients. The study analyzed data from 279 observations, with each observation characterized by 206 features, to differentiate between periodontitis patients with concurrent carotid atherosclerosis and those without. After SMOTE oversampling, the dataset was increased to 390 observations. As stated in the Methods, SMOTE was applied after baseline analysis to augment the dataset for model development. Feature selection through bootstrap methods identified 26 high-frequency features (> 500 times), which were further refined to a final set of 20 features using Logistic Regression combined with AIC. Three machine learning models—Logistic Regression (LR), Support Vector Machine (SVM), and Random Forest (RF)—were developed and evaluated using five-fold cross-validation. The best-performing model was the RF model, achieving an Area Under the Curve(AUC) of 0.892, sensitivity of 0.957, specificity of 0.710, and accuracy of 0.859. Receiver Operating Characteristic(ROC) curves and calibration plots demonstrated good predictive performance and model calibration across all three models. Decision curve analysis showed that the RF model provided the highest net benefit across a range of risk thresholds, indicating its potential for clinical utility in early detection of carotid atherosclerosis in periodontitis patients. This study developed a random forest model using CBCT radiomics to detect carotid atherosclerosis in periodontitis patients early. After rigorous feature selection and five-fold cross-validation, it achieved an AUC of 0.892, with sensitivity of 0.957 and specificity of 0.710. The model shows high predictive performance and clinical utility, offering an effective tool for early detection.

## Linked entities

- **Diseases:** periodontitis (MONDO:0005076)

## Full-text entities

- **Diseases:** periodontitis (MESH:D010518), carotid stenosis (MESH:D016893)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12963598/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12963598/full.md

## References

10 references — full list in the complete paper: https://tomesphere.com/paper/PMC12963598/full.md

---
Source: https://tomesphere.com/paper/PMC12963598