Pyramid Hierarchical Masked Diffusion Model for Imaging Synthesis
Xiaojiao Xiao, Qinmin Vivian Hu, Guanghui Wang

TL;DR
This paper introduces PHMDiff, a multi-scale hierarchical diffusion model with masked training and Transformer integration, achieving state-of-the-art medical image synthesis quality across resolutions and modalities.
Contribution
The novel PHMDiff model combines multi-scale hierarchical diffusion with masking and Transformer-based regularization for improved medical image synthesis.
Findings
Outperforms existing methods in PSNR and SSIM
Produces high-quality images with structural integrity
Effective across different resolutions and modalities
Abstract
Medical image synthesis plays a crucial role in clinical workflows, addressing the common issue of missing imaging modalities due to factors such as extended scan times, scan corruption, artifacts, patient motion, and intolerance to contrast agents. The paper presents a novel image synthesis network, the Pyramid Hierarchical Masked Diffusion Model (PHMDiff), which employs a multi-scale hierarchical approach for more detailed control over synthesizing high-quality images across different resolutions and layers. Specifically, this model utilizes randomly multi-scale high-proportion masks to speed up diffusion model training, and balances detail fidelity and overall structure. The integration of a Transformer-based Diffusion model process incorporates cross-granularity regularization, modeling the mutual information consistency across each granularity's latent spaces, thereby enhancing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Image Enhancement Techniques · Cell Image Analysis Techniques
