Verifiable and Energy Efficient Medical Image Analysis with Quantised Self-attentive Deep Neural Networks
Rakshith Sathish, Swanand Khare, Debdoot Sheet

TL;DR
This paper introduces quantised self-attentive neural networks as an energy-efficient and hardware-friendly alternative to CNNs for medical image analysis, achieving significant reductions in model size, parameters, and energy consumption.
Contribution
It proposes replacing convolutional layers with stand-alone self-attention layers and applying quantisation, demonstrating improved efficiency in medical image classification and segmentation tasks.
Findings
50-80% reduction in model size
60-80% fewer parameters
40-85% fewer FLOPs
Abstract
Convolutional Neural Networks have played a significant role in various medical imaging tasks like classification and segmentation. They provide state-of-the-art performance compared to classical image processing algorithms. However, the major downside of these methods is the high computational complexity, reliance on high-performance hardware like GPUs and the inherent black-box nature of the model. In this paper, we propose quantised stand-alone self-attention based models as an alternative to traditional CNNs. In the proposed class of networks, convolutional layers are replaced with stand-alone self-attention layers, and the network parameters are quantised after training. We experimentally validate the performance of our method on classification and segmentation tasks. We observe a reduction in model size, lesser number of parameters, fewer FLOPs and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification · Radiomics and Machine Learning in Medical Imaging
