Classification of Gleason Grading in Prostate Cancer Histopathology Images Using Deep Learning Techniques: YOLO, Vision Transformers, and Vision Mamba
Amin Malekmohammadi, Ali Badiezadeh, Seyed Mostafa Mirhassani, Parisa, Gifani, Majid Vafaeezadeh

TL;DR
This study compares deep learning models YOLO, Vision Transformers, and Vision Mamba for automating Gleason grading in prostate cancer histopathology images, aiming to improve diagnostic accuracy and efficiency.
Contribution
It introduces a comparative evaluation of three advanced deep learning models for Gleason grading, highlighting Vision Mamba as the most effective.
Findings
Vision Mamba achieved the highest accuracy and lowest false rates.
YOLO offered the best speed for real-time analysis.
Vision Transformers captured image dependencies but with higher computational cost.
Abstract
Prostate cancer ranks among the leading health issues impacting men, with the Gleason scoring system serving as the primary method for diagnosis and prognosis. This system relies on expert pathologists to evaluate samples of prostate tissue and assign a Gleason grade, a task that requires significant time and manual effort. To address this challenge, artificial intelligence (AI) solutions have been explored to automate the grading process. In light of these challenges, this study evaluates and compares the effectiveness of three deep learning methodologies, YOLO, Vision Transformers, and Vision Mamba, in accurately classifying Gleason grades from histopathology images. The goal is to enhance diagnostic precision and efficiency in prostate cancer management. This study utilized two publicly available datasets, Gleason2019 and SICAPv2, to train and test the performance of YOLO, Vision…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Medical Imaging and Analysis
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Mamba: Linear-Time Sequence Modeling with Selective State Spaces
