Magnification Invariant Medical Image Analysis: A Comparison of   Convolutional Networks, Vision Transformers, and Token Mixers

Pranav Jeevan; Nikhil Cherian Kurian; Amit Sethi

arXiv:2302.11488·eess.IV·February 23, 2023

Magnification Invariant Medical Image Analysis: A Comparison of Convolutional Networks, Vision Transformers, and Token Mixers

Pranav Jeevan, Nikhil Cherian Kurian, Amit Sethi

PDF

Open Access

TL;DR

This paper compares various deep learning architectures for medical image analysis, emphasizing their robustness to magnification scale changes, and finds WaveMix to be invariant and stable across different magnifications.

Contribution

The study provides a comprehensive comparison of CNNs, Vision Transformers, and token mixers, highlighting WaveMix's invariance to magnification changes in histopathological image classification.

Findings

01

WaveMix maintains stable performance across magnification scales.

02

CNNs like ResNet and MobileNet show degraded performance with scale variation.

03

Vision Transformers exhibit some robustness but less than WaveMix.

Abstract

Convolution Neural Networks (CNNs) are widely used in medical image analysis, but their performance degrade when the magnification of testing images differ from the training images. The inability of CNNs to generalize across magnification scales can result in sub-optimal performance on external datasets. This study aims to evaluate the robustness of various deep learning architectures in the analysis of breast cancer histopathological images with varying magnification scales at training and testing stages. Here we explore and compare the performance of multiple deep learning architectures, including CNN-based ResNet and MobileNet, self-attention-based Vision Transformers and Swin Transformers, and token-mixing models, such as FNet, ConvMixer, MLP-Mixer, and WaveMix. The experiments are conducted using the BreakHis dataset, which contains breast cancer histopathological images at varying…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in cancer detection · Radiomics and Machine Learning in Medical Imaging · Medical Imaging and Analysis

MethodsBatch Normalization · 1x1 Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Convolution · Kaiming Initialization · Max Pooling · Average Pooling · Residual Block · Residual Connection · Layer Normalization