Brain Stroke Detection and Classification Using CT Imaging with Transformer Models and Explainable AI
Shomukh Qari, Maha A. Thafar

TL;DR
This study developed a high-accuracy, explainable AI framework using transformer models for multiclass stroke detection in CT scans, aiming to improve early diagnosis and clinical trust.
Contribution
It introduces a novel transformer-based AI model with explainability for stroke classification, addressing accuracy, transparency, and clinical applicability.
Findings
MaxViT achieved 98% accuracy and F1-score.
Data augmentation improved model performance.
Grad-CAM++ provided interpretable visual explanations.
Abstract
Stroke is one of the leading causes of death globally, making early and accurate diagnosis essential for improving patient outcomes, particularly in emergency settings where timely intervention is critical. CT scans are the key imaging modality because of their speed, accessibility, and cost-effectiveness. This study proposed an artificial intelligence framework for multiclass stroke classification (ischemic, hemorrhagic, and no stroke) using CT scan images from a dataset provided by the Republic of Turkey's Ministry of Health. The proposed method adopted MaxViT, a state-of-the-art Vision Transformer, as the primary deep learning model for image-based stroke classification, with additional transformer variants (vision transformer, transformer-in-transformer, and ConvNext). To enhance model generalization and address class imbalance, we applied data augmentation techniques, including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging and Analysis · Radiomics and Machine Learning in Medical Imaging · Brain Tumor Detection and Classification
