Cross-modal Attention for MRI and Ultrasound Volume Registration
Xinrui Song, Hengtao Guo, Xuanang Xu, Hanqing Chao, Sheng Xu, Baris, Turkbey, Bradford J. Wood, Ge Wang, Pingkun Yan

TL;DR
This paper introduces a novel cross-modal attention mechanism for MRI and ultrasound volume registration, significantly enhancing registration accuracy and interpretability over traditional CNN approaches.
Contribution
It develops a specialized self-attention block for cross-modal image registration, outperforming larger CNN models and improving interpretability.
Findings
Cross-modal attention block improves registration accuracy
Model outperforms larger CNN networks by 10 times in size
Visualization techniques enhance interpretability
Abstract
Prostate cancer biopsy benefits from accurate fusion of transrectal ultrasound (TRUS) and magnetic resonance (MR) images. In the past few years, convolutional neural networks (CNNs) have been proved powerful in extracting image features crucial for image registration. However, challenging applications and recent advances in computer vision suggest that CNNs are quite limited in its ability to understand spatial correspondence between features, a task in which the self-attention mechanism excels. This paper aims to develop a self-attention mechanism specifically for cross-modal image registration. Our proposed cross-modal attention block effectively maps each of the features in one volume to all features in the corresponding volume. Our experimental results demonstrate that a CNN network designed with the cross-modal attention block embedded outperforms an advanced CNN network 10 times…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Image Segmentation Techniques · AI in cancer detection
