# Explainable depth-wise and channel-wise fusion models for multi-class skin lesion classification

**Authors:** Humam AbuAlkebash, Radhwan A. A. Saleh, H. Metin Ertunç, Lin Xu, Lin Xu, Lin Xu, Lin Xu

PMC · DOI: 10.1371/journal.pone.0340901 · PLOS One · 2026-01-22

## TL;DR

This paper introduces explainable AI models for classifying skin lesions, combining CNNs and Vision Transformers to achieve high accuracy and transparency.

## Contribution

A novel fusion framework combining depth-wise and channel-wise strategies with explainability for multi-class skin lesion classification.

## Key findings

- The optimized fusion model achieves 90% weighted average Precision, Recall, and F1 score on the HAM10000 dataset.
- Explainable AI analysis shows the model's predictions align with key dermatological features like border irregularity and color variegation.

## Abstract

The clinical adoption of deep learning in dermatology requires models that are not only highly accurate but also transparent and trustworthy. To address this dual challenge, this study presents a systematic investigation into deep feature fusion, exploring how to effectively combine complementary representations from diverse neural network architectures. We design and rigorously evaluate six distinct fusion models, first investigating depth-wise and channel-wise strategies for integrating features from powerful Convolutional Neural Network (CNN) backbones, and subsequently incorporating the global contextual awareness of Vision Transformers (ViTs). Evaluated on the challenging 7-class HAM10000 dataset, our optimized architecture achieves a weighted average Precision, Recall, and F1 score of 90%, demonstrating superior diagnostic performance. Crucially, our comprehensive explainable AI (XAI) analysis using Grad-CAM and SHAP reveals that the fusion strategy directly dictates the model’s clinical interpretability; our most effective models learn to base their predictions on salient dermatological features, such as border irregularity and color variegation, in a manner that aligns with expert reasoning. This work provides a robust framework and valuable architectural insights for developing the next generation of high-performing, clinically reliable, and transparent AI-powered diagnostic tools.

## Full-text entities

- **Genes:** KLB (klotho beta) [NCBI Gene 152831] {aka BKL}, VIT (vitrin) [NCBI Gene 5212] {aka VIT1}, NMUR1 (neuromedin U receptor 1) [NCBI Gene 10316] {aka (FM-3), FM-3, FM3, GPC-R, GPR66, NMU1R}, NMUR2 (neuromedin U receptor 2) [NCBI Gene 56923] {aka FM-4, FM4, NMU-R2, NMU2R, TGR-1, TGR1}, SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}
- **Diseases:** VASC (MESH:D014652), Keratosis (MESH:D007642), lesion (MESH:D009059), DL (MESH:D007859), Skin Lesion (MESH:D012871), AKIEC (MESH:D055623), BCC (MESH:D002280), multi- (MESH:D015161), intraepithelial carcinoma/Bowen's disease (MESH:D001913), Melanocytic Nevus (MESH:D009508), DF (MESH:D018219), cancer (MESH:D009369), XAI (MESH:C538243), MEL (MESH:D008545), Kaggle skin cancer (MESH:D012878), papule (MESH:D000169), SCC (MESH:D002294), NV (MESH:D009506)
- **Chemicals:** FM5 (-), melanin (MESH:D008543)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** FM5 — Homo sapiens (Human), Plasma cell myeloma, Cancer cell line (CVCL_6257)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12826527/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12826527/full.md

## References

53 references — full list in the complete paper: https://tomesphere.com/paper/PMC12826527/full.md

---
Source: https://tomesphere.com/paper/PMC12826527