# Study on differentiating benign and malignant thyroid nodules based on CT multi-phase artificial intelligence models

**Authors:** Daoxiong Xiao, Xianzhong Wu, Peng Xie, Binglin Lai, Jianping Zhong, Junyuan Zhong, Xianjun Zeng

PMC · DOI: 10.3389/fendo.2025.1738342 · 2026-01-16

## TL;DR

This study uses AI models with CT scans and clinical data to accurately distinguish between benign and malignant thyroid nodules.

## Contribution

The study introduces AI models combining multiphase CT radiomics and clinical data to improve thyroid nodule classification.

## Key findings

- AI models using CT imaging and clinical data outperformed clinical-only models with an AUC of 0.811.
- Nomograms integrating radiomics scores or AI scores with clinical data achieved AUCs up to 0.885.
- Integrated AI models show high potential for clinical decision-making in thyroid nodule diagnosis.

## Abstract

The rising global incidence of thyroid nodules necessitates improved non-invasive methods for differentiating benign from malignant lesions. However, research on artificial intelligence (AI) models using multiphase CT imaging to differentiate benign from malignant thyroid nodules is limited.

This retrospective study analyzed multiphase CT data (noncontrast, arterial, and venous phases) from 604 patients with thyroid nodules confirmed by postoperative pathology. We developed and compared multiple machine learning and deep learning models using extracted radiomics features, raw 3D DICOM data, and key clinical factors (sex, age, thyroglobulin and thyrotropin levels). Model performance was evaluated using receiver operating characteristic (ROC) analysis, and Gradient-weighted Class Activation Mapping (Grad-CAM) was used for visualization.

Models incorporating imaging data significantly outperformed a clinical-only model (AUC = 0.811). Nomograms combining either a radiomics score (Rad-Score) or a deep learning score (AI-Score) with clinical data demonstrated the highest diagnostic accuracy. The nomogram based on Rad-Score and clinical data achieved a peak AUC of 0.885. Similarly, the AI-Score-based nomogram reached an AUC of 0.881. Both integrated approaches proved superior to models relying on a single data type.

AI models integrating multiphase CT radiomics or deep learning features with clinical data provide a robust and highly accurate approach for differentiating benign from malignant thyroid nodules. These integrated models show significant potential for improving clinical decision-making.

## Full-text entities

- **Genes:** TG (thyroglobulin) [NCBI Gene 7038] {aka AITD3, TGN}
- **Diseases:** thyroid nodules (MESH:D016606)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12855141/full.md

---
Source: https://tomesphere.com/paper/PMC12855141