# Predicting modelling for early diagnosis in early myeloproliferative neoplasms, not otherwise specified. Evidence from a machine learning study

**Authors:** Anna Scattone, Andrea Lupo, Concetta Saponaro, Margherita Sonnessa, Paolo Ditonno, Samantha Bove, Annarita Fanizzi, Giuseppe Accogli, Rossana Daprile, Francesco Alfredo Zito, Maria Colomba Comes, Raffaella Massafra

PMC · DOI: 10.3389/fonc.2025.1741610 · Frontiers in Oncology · 2026-01-26

## TL;DR

This study uses machine learning to help diagnose early myeloproliferative neoplasms by analyzing bone marrow biopsy images, showing promising results in supporting clinical decisions.

## Contribution

The study introduces a machine learning model for classifying early myeloproliferative neoplasms using morphological features from bone marrow biopsies.

## Key findings

- The model achieved a mean multiclass AUC of 0.70 and accuracy of 0.60 in classifying MPN subtypes.
- On MPN-NOS patients, the model agreed with pathologists 61.5% of the time and with clinical follow-up 50% of the time.

## Abstract

Recently, artificial intelligence (AI) has become a valuable tool for diagnosing and predicting outcomes in blood disorders. Whole Slide Imaging (WSI) of bone marrow biopsies (BMBs) offers detailed, high-resolution views of cells and tissues; its adoption may improve the resources dedicated to the interpretation of BMB with suspected early myeloproliferative neoplasms (MPNs).

We collected a retrospective dataset of H&E-stained BMBs from 88 patients diagnosed with MPN, divided into three groups: 19 with prefibrotic primary myelofibrosis (pre-PMF), 30 with polycythemia vera (PV), and 39 with essential thrombocythemia (ET). Using AI, we framed this as a three-class classification problem. For each whole slide image, we automatically calculated cellularity and cell density. We extracted morphological features related to megakaryocytes—area, perimeter, and circularity—and summarized them for each patient using statistics like mean, standard deviation, skewness, kurtosis, and entropy. This resulted in 17 features combining nuclear morphology, cell density statistics, and cellularity. After selecting significant features with the Kruskal-Wallis test, we trained a Support Vector Machine (SVM) classifier with 5-fold cross-validation to predict MPN subtypes. We then tested the model on 13 patients with a diagnosis of MPN not otherwise specified (MPN-NOS) to assess its capacity to correctly characterize the diagnosis.

The model achieved a mean multiclass Area Under the Curve (AUC) of 0.70 ± 0.01 and an accuracy of 0.60 ± 0.03. When tested on the 13 patients with MPN not otherwise specified (MPN-NOS), the model agreed with pathologists’ biopsy classifications 61.5% of the time and with clinical follow-up evaluations 50% of the time.

This study represents a first step toward the development of automated tools to support MPN diagnosis, providing potential assistance to haematologists and pathologists in the clinical management of patients.

## Linked entities

- **Diseases:** myeloproliferative neoplasms (MONDO:0020076), polycythemia vera (MONDO:0009891), essential thrombocythemia (MONDO:0005029)

## Full-text entities

- **Diseases:** primary myelofibrosis (MESH:D055728), MPN not otherwise specified (MESH:C536665), MPNs (MESH:D009369), PV (MESH:D011087), ET (MESH:D013920), blood disorders (MESH:D006402)
- **Chemicals:** H&amp;E (MESH:D006371)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12883421/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12883421/full.md

## References

28 references — full list in the complete paper: https://tomesphere.com/paper/PMC12883421/full.md

---
Source: https://tomesphere.com/paper/PMC12883421