# Demographic, Morphological, and Histopathological Characteristics of Melanoma and Nevi: Insights from Statistical Analysis and Machine Learning Models

**Authors:** Blagjica Lazarova, Gordana Petrushevska, Zdenka Stojanovska, Stephen C. Mullins

PMC · DOI: 10.3390/diagnostics15192499 · Diagnostics · 2025-10-01

## TL;DR

This study uses statistical and machine learning methods to identify key factors that help distinguish melanomas from benign nevi, improving diagnostic accuracy in dermatopathology.

## Contribution

The study introduces a robust machine learning framework using glmnet to classify melanocytic lesions with high accuracy and discrimination.

## Key findings

- Age, lesion size, and histopathological features like cytological and extracellular matrix changes were key predictors of melanoma.
- The glmnet model achieved high accuracy (95.3% training, 92.6% test) and strong discrimination (AUC = 0.97).
- Collinearity and overlapping effects reduced the significance of some variables in multivariable analysis.

## Abstract

Background: Early and accurate differentiation between melanomas and benign nevi is essential for making proper clinical decisions. This study aimed to identify clinical, morphological, and histopathological variables most strongly associated with melanoma, using both statistical and machine learning approaches. Methods: This study evaluated 184 melanocytic lesions using clinical, morphological, and histopathological parameters. Univariable analyses were performed in XLStat statistical software, version 2014.5.03, while multivariable machine learning models were developed in Jamovi (version 2.4). Five supervised algorithms (random forest, partial least squares, elastic net regression, conditional inference trees, and k-nearest neighbors) were compared using repeated cross-validation, with performance evaluated by accuracy, Kappa, sensitivity, specificity, F1 score, and calibration. Results: Univariable analysis identified significant differences between melanomas and nevi in age, horizontal diameter, gender, lesion location, and selected histopathological features (cytological and extracellular matrix changes, epidermal interactions). However, several associations weakened in multivariable analysis due to collinearity and overlapping effects. Using glmnet, the most influential independent predictors were cytological changes, horizontal diameter, epidermal interactions, and extracellular matrix features, alongside age, gender, and lesion location. The model achieved high discrimination (AUC = 0.97, 95% CI: 0.93–0.99) and accuracy (training: 95.3%; test: 92.6%), confirming robustness. Conclusions: Structured demographic, morphological, and histopathological data—particularly age, lesion size, cytological and extracellular matrix changes, and epidermal interactions—can effectively support classification of melanocytic lesions. Machine learning approaches (the glmnet model in our study) provide a reliable framework to evaluate such predictors and offer practical diagnostic support in dermatopathology.

## Linked entities

- **Diseases:** melanoma (MONDO:0005105)

## Full-text entities

- **Diseases:** Melanoma (MESH:D008545), Nevi (MESH:D009506), melanocytic lesions (MESH:D009508)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12524229/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12524229/full.md

## References

26 references — full list in the complete paper: https://tomesphere.com/paper/PMC12524229/full.md

---
Source: https://tomesphere.com/paper/PMC12524229