Symmetric Transformer-based Network for Unsupervised Image Registration
Mingrui Ma, Lei Song, Yuanbo Xu, Guixia Liu

TL;DR
This paper introduces SymTrans, a symmetric transformer-based neural network with a novel efficient attention mechanism for unsupervised medical image registration, achieving state-of-the-art results.
Contribution
It proposes a convolution-based efficient multi-head self-attention (CEMSA) and a symmetric transformer architecture for improved registration performance.
Findings
Achieves state-of-the-art registration accuracy
Reduces model parameters and computational complexity
Effective in displacement field and diffeomorphic registration
Abstract
Medical image registration is a fundamental and critical task in medical image analysis. With the rapid development of deep learning, convolutional neural networks (CNN) have dominated the medical image registration field. Due to the disadvantage of the local receptive field of CNN, some recent registration methods have focused on using transformers for non-local registration. However, the standard Transformer has a vast number of parameters and high computational complexity, which causes Transformer can only be applied at the bottom of the registration models. As a result, only coarse information is available at the lowest resolution, limiting the contribution of Transformer in their models. To address these challenges, we propose a convolution-based efficient multi-head self-attention (CEMSA) block, which reduces the parameters of the traditional Transformer and captures local spatial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging and Analysis · Medical Image Segmentation Techniques · Advanced Neural Network Applications
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Byte Pair Encoding · Dense Connections · Residual Connection · Dropout · Adam · Position-Wise Feed-Forward Layer
