Towards Explainable Skin Cancer Classification: A Dual-Network Attention Model with Lesion Segmentation and Clinical Metadata Fusion
Md. Enamul Atiq, Shaikh Anowarul Fattah

TL;DR
This paper introduces a dual-network attention model that combines lesion segmentation and clinical metadata to improve the accuracy and interpretability of skin cancer classification from dermoscopic images.
Contribution
It presents a novel dual-encoder architecture with attention mechanisms and metadata fusion, enhancing both classification performance and model interpretability.
Findings
Achieves state-of-the-art segmentation performance.
Significantly improves classification accuracy and AUC.
Provides interpretable heatmaps focusing on lesion areas.
Abstract
Skin cancer is a life-threatening disease where early detection significantly improves patient outcomes. Automated diagnosis from dermoscopic images is challenging due to high intra-class variability and subtle inter-class differences. Many deep learning models operate as "black boxes," limiting clinical trust. In this work, we propose a dual-encoder attention-based framework that leverages both segmented lesions and clinical metadata to enhance skin lesion classification in terms of both accuracy and interpretability. A novel Deep-UNet architecture with Dual Attention Gates (DAG) and Atrous Spatial Pyramid Pooling (ASPP) is first employed to segment lesions. The classification stage uses two DenseNet201 encoders-one on the original image and another on the segmented lesion whose features are fused via multi-head cross-attention. This dual-input design guides the model to focus on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCutaneous Melanoma Detection and Management · Nonmelanoma Skin Cancer Studies · AI in cancer detection
