# Ultrasound radiomics-based machine learning models for risk stratification of follicular thyroid tumors

**Authors:** Ya Yuan, Xinyue Wang, Hongyan Deng, Kunpeng Cao, Fei Yu

PMC · DOI: 10.3389/fonc.2025.1707586 · Frontiers in Oncology · 2026-01-05

## TL;DR

This study explores using ultrasound radiomics and machine learning to better distinguish between benign and malignant follicular thyroid tumors.

## Contribution

The novel contribution is integrating peritumoral radiomic features with clinical-ultrasound data to improve risk stratification of follicular thyroid tumors.

## Key findings

- Logistic regression with peritumoral radiomics showed stable performance and improved metrics like micro-AUC and NPV.
- All models struggled to accurately identify follicular tumors of uncertain malignant potential (FT-UMP).
- Radiomics-based models exhibited overfitting, with the best test accuracy of 0.643.

## Abstract

Follicular thyroid carcinoma (FTC) is the second most common malignant thyroid tumor. Preoperative differentiation among follicular thyroid adenoma (FA), follicular tumor of uncertain malignant potential (FT-UMP), and FTC remains challenging using conventional ultrasound and fine-needle aspiration. This study aims to develop a machine learning model utilizing ultrasound radiomic features to improve risk stratification of follicular thyroid tumors.

A total of 277 patients with histopathologically confirmed follicular tumors (163 FA, 63 FT-UMP, 51 FTC) were included. Clinical and ultrasound features, along with radiomic features from intratumoral and peritumoral regions, were extracted from preoperative ultrasound images. Three machine learning models—logistic regression (LR), support vector machine (SVM), and random forest (RF)—were trained to construct four models: clinical-ultrasound (U), clinical-ultrasound with intratumoral radiomics (UI), clinical-ultrasound with peritumoral radiomics (UP), and clinical-ultrasound with combined intratumoral and peritumoral radiomics (UIP).

The RF-based clinical-ultrasound model demonstrated the highest accuracy (test: 0.643) but exhibited significant overfitting in radiomics-based models. The SVM model showed moderate performance. The LR model in the UP and UIP models delivered stable performance, achieving the highest test accuracy of 0.643. Specifically, the UP model showed improved micro-AUC, specificity, negative predictive value (NPV), and F1 score. The LR model exhibited high sensitivity but low specificity for benign nodules, and high specificity but low sensitivity for malignant nodules. All models performed poorly in identifying FT-UMP nodules.

Integrating peritumoral radiomic features with clinical-ultrasound features using logistic regression enhances the differentiation between benign and malignant follicular thyroid tumors.

## Linked entities

- **Diseases:** follicular thyroid carcinoma (MONDO:0005034), follicular thyroid adenoma (MONDO:0005032)

## Full-text entities

- **Diseases:** follicular tumor of uncertain (MESH:D009369), UMP (MESH:C537136), FA (MESH:D000236), thyroid tumor (MESH:D013964), FTC (MESH:D018263)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12812758/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12812758/full.md

## References

32 references — full list in the complete paper: https://tomesphere.com/paper/PMC12812758/full.md

---
Source: https://tomesphere.com/paper/PMC12812758