DSVTLA: Deep Swin Vision Transformer-Based Transfer Learning Architecture for Multi-Type Cancer Histopathological Cancer Image Classification

Muazzem Hussain Khan; Tasdid Hasnain; Md. Jamil khan; Ruhul Amin; Md. Shamim Reza; Md. Al Mehedi Hasan; Md Ashad Alam

arXiv:2604.09468·eess.IV·April 13, 2026

DSVTLA: Deep Swin Vision Transformer-Based Transfer Learning Architecture for Multi-Type Cancer Histopathological Cancer Image Classification

Muazzem Hussain Khan, Tasdid Hasnain, Md. Jamil khan, Ruhul Amin, Md. Shamim Reza, Md. Al Mehedi Hasan, Md Ashad Alam

PDF

TL;DR

This paper introduces a novel deep Swin Vision Transformer-based transfer learning architecture that significantly improves multi-cancer histopathological image classification accuracy across diverse datasets.

Contribution

The study presents a new hierarchical Swin Transformer combined with ResNet50 features, outperforming existing CNN and transformer models in multi-cancer classification tasks.

Findings

01

Achieved up to 100% accuracy on lung-colon cancer datasets.

02

Demonstrated superior performance over several state-of-the-art models.

03

Provided a robust, interpretable system for multi-cancer diagnosis.

Abstract

In this study, we proposed a deep Swin-Vision Transformer-based transfer learning architecture for robust multi-cancer histopathological image classification. The proposed framework integrates a hierarchical Swin Transformer with ResNet50-based convolution features extraction, enabling the model to capture both long-range contextual dependencies and fine-grained local morphological patterns within histopathological images. To validate the efficiency of the proposed architecture, an extensive experiment was executed on a comprehensive multi-cancer dataset including Breast Cancer, Oral Cancer, Lung and Colon Cancer, Kidney Cancer, and Acute Lymphocytic Leukemia (ALL), including both original and segmented images were analyzed to assess model robustness across heterogeneous clinical imaging conditions. Our approach is benchmarked alongside several state-of-the-art CNN and transfer models,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.