Transformers Improve Breast Cancer Diagnosis from Unregistered Multi-View Mammograms
Xuxin Chen, Ke Zhang, Neman Abdoli, Patrik W. Gilley, Ximin Wang, Hong, Liu, Bin Zheng, Yuchen Qiu

TL;DR
This paper introduces a Transformer-based model for breast cancer diagnosis from unregistered multi-view mammograms, capturing long-range dependencies across views and sides, leading to improved classification performance over CNNs.
Contribution
The study presents a novel multi-view Transformer architecture that effectively models relationships among unregistered mammograms, outperforming existing CNN-based methods in breast cancer detection.
Findings
Transformer model achieves AUC of 0.818, surpassing CNNs.
Model performs well without complex preprocessing.
Outperforms single-view models significantly.
Abstract
Deep convolutional neural networks (CNNs) have been widely used in various medical imaging tasks. However, due to the intrinsic locality of convolution operation, CNNs generally cannot model long-range dependencies well, which are important for accurately identifying or mapping corresponding breast lesion features computed from unregistered multiple mammograms. This motivates us to leverage the architecture of Multi-view Vision Transformers to capture long-range relationships of multiple mammograms from the same patient in one examination. For this purpose, we employ local Transformer blocks to separately learn patch relationships within four mammograms acquired from two-view (CC/MLO) of two-side (right/left) breasts. The outputs from different views and sides are concatenated and fed into global Transformer blocks, to jointly learn patch relationships between four images representing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · COVID-19 diagnosis using AI · Radiomics and Machine Learning in Medical Imaging
MethodsAttention Is All You Need · Linear Layer · Softmax · Dropout · Dense Connections · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Multi-Head Attention · Byte Pair Encoding · Label Smoothing
