Structural Attention: Rethinking Transformer for Unpaired Medical Image Synthesis
Vu Minh Hieu Phan, Yutong Xie, Bowen Zhang, Yuankai Qi, Zhibin Liao,, Antonios Perperidis, Son Lam Phung, Johan W. Verjans, Minh-Son To

TL;DR
This paper introduces UNest, a structural attention-based transformer architecture that leverages anatomical priors to improve unpaired medical image synthesis across multiple modalities, outperforming existing methods.
Contribution
We propose UNest, a novel transformer architecture with structural inductive biases and anatomical guidance for unpaired medical image synthesis.
Findings
UNest improves synthesis quality by up to 19.30% over recent methods.
Incorporating structural biases enhances anatomical detail preservation.
The method is effective across MR, CT, and PET datasets.
Abstract
Unpaired medical image synthesis aims to provide complementary information for an accurate clinical diagnostics, and address challenges in obtaining aligned multi-modal medical scans. Transformer-based models excel in imaging translation tasks thanks to their ability to capture long-range dependencies. Although effective in supervised training settings, their performance falters in unpaired image synthesis, particularly in synthesizing structural details. This paper empirically demonstrates that, lacking strong inductive biases, Transformer can converge to non-optimal solutions in the absence of paired data. To address this, we introduce UNet Structured Transformer (UNest), a novel architecture incorporating structural inductive biases for unpaired medical image synthesis. We leverage the foundational Segment-Anything Model to precisely extract the foreground structure and perform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Cell Image Analysis Techniques · AI in cancer detection
MethodsAttention Is All You Need · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Position-Wise Feed-Forward Layer · Dropout · Adam · Linear Layer · Absolute Position Encodings
