Multi-scale Transformer Network with Edge-aware Pre-training for Cross-Modality MR Image Synthesis
Yonghao Li, Tao Zhou, Kelei He, Yi Zhou, Dinggang Shen

TL;DR
This paper introduces a multi-scale transformer network with edge-aware pre-training to improve cross-modality MR image synthesis, effectively leveraging limited paired data and abundant unpaired data for better image generation.
Contribution
The paper proposes a novel pre-training approach with Edge-MAE and a dual-scale fusion network, enhancing synthesis performance with less paired data.
Findings
Achieves comparable results with only 30% of paired data.
Effective use of unpaired data through self-supervised pre-training.
Improves structural preservation in synthesized images.
Abstract
Cross-modality magnetic resonance (MR) image synthesis can be used to generate missing modalities from given ones. Existing (supervised learning) methods often require a large number of paired multi-modal data to train an effective synthesis model. However, it is often challenging to obtain sufficient paired data for supervised training. In reality, we often have a small number of paired data while a large number of unpaired data. To take advantage of both paired and unpaired data, in this paper, we propose a Multi-scale Transformer Network (MT-Net) with edge-aware pre-training for cross-modality MR image synthesis. Specifically, an Edge-preserving Masked AutoEncoder (Edge-MAE) is first pre-trained in a self-supervised manner to simultaneously perform 1) image imputation for randomly masked patches in each image and 2) whole edge map estimation, which effectively learns both contextual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Medical Image Segmentation Techniques
MethodsAttention Is All You Need · *Communicated@Fast*How Do I Communicate to Expedia? · Convolution · Absolute Position Encodings · Layer Normalization · Softmax · Dropout · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Concatenated Skip Connection
