# Pilot Exploratory Study of a CT Radiomics Model for the Classification of Small Cell Lung Cancer and Non-Small-Cell Lung Cancer in the Moscow Population: A Step Toward Virtual Biopsy

**Authors:** Maria D. Varyukhina, Alexandr A. Borisov, Rustam A. Erizhokov, Kirill M. Arzamasov, Alexander V. Solovev, Vadim V. Kirsanov, Olga V. Omelyanskaya, Anton V. Vladzymyrskyy, Yuriy A. Vasilev

PMC · DOI: 10.3390/jimaging11100331 · Journal of Imaging · 2025-09-25

## TL;DR

This study explores using CT scan radiomics and machine learning to non-invasively distinguish between two types of lung cancer in a Moscow population, aiming to reduce the need for biopsies.

## Contribution

The study introduces a radiomics-based machine learning model for differentiating SCLC and NSCLC using CT scans in a specific population.

## Key findings

- A gradient boosting model achieved 80.5% accuracy in differentiating SCLC and NSCLC.
- The model had a high ROC AUC of 0.888, indicating strong classification performance.
- Radiomics combined with machine learning shows potential for non-invasive lung cancer subtype classification.

## Abstract

Lung cancer is one of the most common and socially significant cancers worldwide and consists of two main subtypes: small cell lung cancer (SCLC) and non-small cell lung cancer (NSCLC), which require different treatments. Computed tomography (CT) scans cannot reliably differentiate these subtypes, often necessitating invasive biopsies that carry significant risks. Radiomics offers a promising non-invasive alternative by quantitatively analyzing imaging data to extract detailed tissue characteristics beyond visual assessment. This pilot retrospective study analyzed 200 Moscow patients with histologically confirmed SCLC or NSCLC. Manual tumor segmentation on pretreatment CT scans allowed extraction of 107 radiomic features, from which 16 key features were selected to train four machine learning models. Models were evaluated using stratified 5-fold cross-validation, focusing on ROC AUC, accuracy, precision, and recall. All models demonstrated strong performance in distinguishing SCLC from NSCLC, with the gradient boosting model achieving the highest accuracy of 80.5% and ROC AUC of 0.888. These results highlight the potential of radiomics combined with machine learning to enable accurate, non-invasive differentiation of lung cancer subtypes. Further research is needed to expand feature sets, develop automated segmentation tools, and enhance clinical application of this approach.

## Linked entities

- **Diseases:** small cell lung cancer (MONDO:0008433), non-small cell lung cancer (MONDO:0005233)

## Full-text entities

- **Diseases:** Lung cancer (MESH:D008175), cancers (MESH:D009369), SCLC (MESH:D055752), NSCLC (MESH:D002289)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12565470/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12565470/full.md

## References

31 references — full list in the complete paper: https://tomesphere.com/paper/PMC12565470/full.md

---
Source: https://tomesphere.com/paper/PMC12565470