Unsupervised Echocardiography Registration through Patch-based MLPs and Transformers
Zihao Wang, Yingyu Yang, Maxime Sermesant, Herve Delingette

TL;DR
This paper introduces patch-based MLP and Transformer frameworks for unsupervised echocardiography image registration, demonstrating their effectiveness and efficiency compared to CNN-based methods, with improved preservation of volume changes.
Contribution
The work provides a novel benchmark and demonstrates that patch-based MLP and Transformer models can achieve superior or comparable registration performance in echocardiography.
Findings
Patch-based models outperform CNN in preserving volume changes.
Models achieve high registration accuracy on large datasets.
Patch-based methods are efficient in time and space complexity.
Abstract
Image registration is an essential but challenging task in medical image computing, especially for echocardiography, where the anatomical structures are relatively noisy compared to other imaging modalities. Traditional (non-learning) registration approaches rely on the iterative optimization of a similarity metric which is usually costly in time complexity. In recent years, convolutional neural network (CNN) based image registration methods have shown good effectiveness. In the meantime, recent studies show that the attention-based model (e.g., Transformer) can bring superior performance in pattern recognition tasks. In contrast, whether the superior performance of the Transformer comes from the long-winded architecture or is attributed to the use of patches for dividing the inputs is unclear yet. This work introduces three patch-based frameworks for image registration using MLPs and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Softmax · Layer Normalization · Adam · Linear Layer · Dense Connections · Residual Connection · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Label Smoothing
