An Exploratory Approach Towards Investigating and Explaining Vision Transformer and Transfer Learning for Brain Disease Detection

Shuvashis Sarker; Shamim Rahim Refat; Faika Fairuj Preotee; Shifat Islam; Tashreef Muhammad; Mohammad Ashraful Hoque

arXiv:2505.16039·cs.CV·June 24, 2025

An Exploratory Approach Towards Investigating and Explaining Vision Transformer and Transfer Learning for Brain Disease Detection

Shuvashis Sarker, Shamim Rahim Refat, Faika Fairuj Preotee, Shifat Islam, Tashreef Muhammad, Mohammad Ashraful Hoque

PDF

TL;DR

This study compares Vision Transformer and transfer learning models for brain disease classification using MRI, demonstrating ViT's superior accuracy and employing explainable AI techniques to improve interpretability for medical diagnosis.

Contribution

It provides a comparative analysis of ViT and transfer learning models for brain MRI classification, highlighting ViT's effectiveness and integrating XAI methods for better model transparency.

Findings

01

ViT achieved 94.39% accuracy in classifying brain diseases.

02

ViT outperformed traditional transfer learning models.

03

XAI methods improved interpretability of model predictions.

Abstract

The brain is a highly complex organ that manages many important tasks, including movement, memory and thinking. Brain-related conditions, like tumors and degenerative disorders, can be hard to diagnose and treat. Magnetic Resonance Imaging (MRI) serves as a key tool for identifying these conditions, offering high-resolution images of brain structures. Despite this, interpreting MRI scans can be complicated. This study tackles this challenge by conducting a comparative analysis of Vision Transformer (ViT) and Transfer Learning (TL) models such as VGG16, VGG19, Resnet50V2, MobilenetV2 for classifying brain diseases using MRI data from Bangladesh based dataset. ViT, known for their ability to capture global relationships in images, are particularly effective for medical imaging tasks. Transfer learning helps to mitigate data constraints by fine-tuning pre-trained models. Furthermore,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAttention Is All You Need · Pointwise Convolution · Batch Normalization · Depthwise Convolution · Depthwise Separable Convolution · Average Pooling · Inverted Residual Block · Convolution · 1x1 Convolution · Linear Layer