Deep Modeling and Optimization of Medical Image Classification
Yihang Wu, Muhammad Owais, Reem Kateb, Ahmad Chaddad

TL;DR
This paper explores advanced deep learning models, including CLIP variants, federated learning, and traditional ML, to improve medical image classification accuracy while addressing data privacy and generalization challenges.
Contribution
It introduces a novel CLIP variant with multiple CNNs and ViTs, combines deep models with federated learning for privacy, and integrates traditional ML to enhance generalization in medical image classification.
Findings
MaxViT achieves 87.03% accuracy on HAM10000 with multimodal learning.
ConvNeXt-L outperforms Swin-B with an F1-score of 83.98% in federated learning.
SVM improves metrics by approximately 2% for Swin transformer models.
Abstract
Deep models, such as convolutional neural networks (CNNs) and vision transformer (ViT), demonstrate remarkable performance in image classification. However, those deep models require large data to fine-tune, which is impractical in the medical domain due to the data privacy issue. Furthermore, despite the feasible performance of contrastive language image pre-training (CLIP) in the natural domain, the potential of CLIP has not been fully investigated in the medical field. To face these challenges, we considered three scenarios: 1) we introduce a novel CLIP variant using four CNNs and eight ViTs as image encoders for the classification of brain cancer and skin cancer, 2) we combine 12 deep models with two federated learning techniques to protect data privacy, and 3) we involve traditional machine learning (ML) methods to improve the generalization ability of those deep models in unseen…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCutaneous Melanoma Detection and Management · Brain Tumor Detection and Classification · COVID-19 diagnosis using AI
MethodsLinear Layer · Softmax · Multi-Head Attention · Stochastic Depth · Attention Is All You Need · Residual Connection · Layer Normalization · Dense Connections · Contrastive Language-Image Pre-training · Vision Transformer
