Is attention all you need in medical image analysis? A review
Giorgos Papanastasiou, Nikolaos Dikaios, Jiahao Huang, Chengjia Wang,, Guang Yang

TL;DR
This review discusses the evolution of hybrid CNN-Transformer models in medical image analysis, highlighting their architectural innovations, advantages, challenges, and future research opportunities in improving global and local feature modeling.
Contribution
It provides a comprehensive survey of hybrid CNN-Transformer models in medical imaging, analyzing their design, performance, and potential for clinical impact.
Findings
Hybrid models combine local and global feature modeling effectively.
Hybrid architectures show promising generalization in medical imaging tasks.
The review identifies key challenges and future directions for hybrid models.
Abstract
Medical imaging is a key component in clinical diagnosis, treatment planning and clinical trial design, accounting for almost 90% of all healthcare data. CNNs achieved performance gains in medical image analysis (MIA) over the last years. CNNs can efficiently model local pixel interactions and be trained on small-scale MI data. The main disadvantage of typical CNN models is that they ignore global pixel relationships within images, which limits their generalisation ability to understand out-of-distribution data with different 'global' information. The recent progress of Artificial Intelligence gave rise to Transformers, which can learn global relationships from data. However, full Transformer models need to be trained on large-scale data and involve tremendous computational complexity. Attention and Transformer compartments (Transf/Attention) which can well maintain properties for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · AI in cancer detection · COVID-19 diagnosis using AI
MethodsAttention Is All You Need · Adam · Label Smoothing · Layer Normalization · Absolute Position Encodings · Linear Layer · Softmax · Dense Connections · Multi-Head Attention · Dropout
