# Multimodal skin lesion classification for early cancer diagnosis using deep learning

**Authors:** Vandit Gabani, T. M. Navamani, K. Shyamala, Vinita Kishore Vaswani Rajpal

PMC · DOI: 10.3389/fphys.2026.1717517 · Frontiers in Physiology · 2026-02-09

## TL;DR

This paper presents a deep learning model that improves skin cancer detection using multiple neural network architectures and achieves high accuracy.

## Contribution

The novel contribution is an ensemble model combining three DCNNs with fine-tuning and hyperparameter optimization for skin lesion classification.

## Key findings

- The ensemble model achieved 97.9% testing accuracy in skin lesion classification.
- The model showed 99.2% precision, recall, and F1-score, outperforming individual models.

## Abstract

Skin cancer, particularly melanoma, is a rapidly spreading and potentially life-threatening disease affecting humans. Melanoma typically begins on the skin’s surface before penetrating deeper layers. Early detection significantly improves survival rates, with simple and cost-effective treatments yielding a 96% success rate. Traditional diagnosis methods rely on expert dermatologists, specialized equipment, and invasive biopsies. Deep learning offers advanced solutions for detecting skin cancer earlier and with high accuracy to mitigate costs and assist dermatologists. Deep Convolutional Neural Networks have shown promise in several computer vision tasks, including image classification, prompting their application in dermatology.

This work focuses on leveraging three prominent DCNN architectures, DenseNet 201, VGG16, and InceptionV3, to classify skin lesions using dermoscopic images. The HAM10000 dataset was taken and divided into training and testing sets. The preprocessing methods include image normalization, scaling, and Otsu’s binary thresholding segmentation and augmentation techniques were applied. We introduced two fine-tuning approaches. Firstly, the top layers of the base model are retrained. Secondly, retraining the half layers of the base models and additional layers are added to form customized CNN models. We merge these underlying models into an ensemble and hyperparameter tuning to enhance performance. The transparency and interpretability of the model are enhanced by Grad-CAM, which raises the model’s dependability for clinical applications.

Combining DenseNet-201, InceptionV3, and VGG16, the proposed ensemble model outperforms the individual models with a testing accuracy of 97.9%. Additionally, it exhibits a better F1-score, recall, and precision of 99.2%, demonstrating its efficacy in automated skin lesion detection.

## Linked entities

- **Diseases:** melanoma (MONDO:0005105), skin cancer (MONDO:0002898)
- **Species:** Homo sapiens (taxon 9606)

## Full-text entities

- **Diseases:** carcinogenic (MESH:D011230), Dermatofibroma (MESH:D018219), nevus (MESH:D009506), Benign keratosis-like lesions (MESH:D007642), Basal cell carcinoma (MESH:D002280), skin abnormalities (MESH:D012868), Malignant melanoma (MESH:D008545), Melanocytic nevi (MESH:D009508), Skin Lesions (MESH:D012871), benign tumors (MESH:D009369), Vascular lesions (MESH:D014652), Actinic keratoses (MESH:D055623), vitiligo (MESH:D014820), Skin cancer (MESH:D012878), Lesion (MESH:D009059), Tumorigenesis (MESH:D063646), DL (MESH:D007859)
- **Chemicals:** AlexNet (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12926115/full.md

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12926115/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/PMC12926115/full.md

---
Source: https://tomesphere.com/paper/PMC12926115