Vector Field Attention for Deformable Image Registration
Yihao Liu, Junyu Chen, Lianrui Zuo, Aaron Carass, Jerry L. Prince

TL;DR
This paper introduces Vector Field Attention (VFA), a novel deep learning framework for deformable image registration that directly retrieves pixel correspondences using an attention mechanism, improving efficiency and accuracy.
Contribution
VFA enables direct pixel correspondence retrieval in image registration without learnable parameters, enhancing efficiency over traditional neural network approaches.
Findings
VFA outperforms existing methods on public datasets.
VFA achieves superior accuracy in intra- and inter-modality registration.
VFA is effective in supervised, unsupervised, and semi-supervised settings.
Abstract
Deformable image registration establishes non-linear spatial correspondences between fixed and moving images. Deep learning-based deformable registration methods have been widely studied in recent years due to their speed advantage over traditional algorithms as well as their better accuracy. Most existing deep learning-based methods require neural networks to encode location information in their feature maps and predict displacement or deformation fields though convolutional or fully connected layers from these high-dimensional feature maps. In this work, we present Vector Field Attention (VFA), a novel framework that enhances the efficiency of the existing network design by enabling direct retrieval of location correspondences. VFA uses neural networks to extract multi-resolution feature maps from the fixed and moving images and then retrieves pixel-level correspondences based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Image and Object Detection Techniques · Image Processing and 3D Reconstruction
MethodsSoftmax · Attention Is All You Need · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
