Multi-scale Transformer Network with Edge-aware Pre-training for   Cross-Modality MR Image Synthesis

Yonghao Li; Tao Zhou; Kelei He; Yi Zhou; Dinggang Shen

arXiv:2212.01108·eess.IV·June 21, 2023

Multi-scale Transformer Network with Edge-aware Pre-training for Cross-Modality MR Image Synthesis

Yonghao Li, Tao Zhou, Kelei He, Yi Zhou, Dinggang Shen

PDF

Open Access 2 Repos

TL;DR

This paper introduces a multi-scale transformer network with edge-aware pre-training to improve cross-modality MR image synthesis, effectively leveraging limited paired data and abundant unpaired data for better image generation.

Contribution

The paper proposes a novel pre-training approach with Edge-MAE and a dual-scale fusion network, enhancing synthesis performance with less paired data.

Findings

01

Achieves comparable results with only 30% of paired data.

02

Effective use of unpaired data through self-supervised pre-training.

03

Improves structural preservation in synthesized images.

Abstract

Cross-modality magnetic resonance (MR) image synthesis can be used to generate missing modalities from given ones. Existing (supervised learning) methods often require a large number of paired multi-modal data to train an effective synthesis model. However, it is often challenging to obtain sufficient paired data for supervised training. In reality, we often have a small number of paired data while a large number of unpaired data. To take advantage of both paired and unpaired data, in this paper, we propose a Multi-scale Transformer Network (MT-Net) with edge-aware pre-training for cross-modality MR image synthesis. Specifically, an Edge-preserving Masked AutoEncoder (Edge-MAE) is first pre-trained in a self-supervised manner to simultaneously perform 1) image imputation for randomly masked patches in each image and 2) whole edge map estimation, which effectively learns both contextual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Medical Image Segmentation Techniques

MethodsAttention Is All You Need · *Communicated@Fast*How Do I Communicate to Expedia? · Convolution · Absolute Position Encodings · Layer Normalization · Softmax · Dropout · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Concatenated Skip Connection