ASGDiffusion: Parallel High-Resolution Generation with Asynchronous Structure Guidance
Yuming Li, Peidong Jia, Daiwei Hong, Yueru Jia, Qi She, Rui Zhao, Ming, Lu, Shanghang Zhang

TL;DR
ASGDiffusion introduces a parallel high-resolution image generation method that uses asynchronous structure guidance to improve semantic consistency, reduce pattern repetition, and accelerate the process with multi-GPU support.
Contribution
The paper presents a novel parallel HR image generation approach with asynchronous structure guidance, addressing pattern repetition and computational efficiency issues in diffusion models.
Findings
Reduces pattern repetition in HR images
Significantly accelerates generation speed with multi-GPU parallelism
Achieves state-of-the-art results in high-resolution image generation
Abstract
Training-free high-resolution (HR) image generation has garnered significant attention due to the high costs of training large diffusion models. Most existing methods begin by reconstructing the overall structure and then proceed to refine the local details. Despite their advancements, they still face issues with repetitive patterns in HR image generation. Besides, HR generation with diffusion models incurs significant computational costs. Thus, parallel generation is essential for interactive applications. To solve the above limitations, we introduce a novel method named ASGDiffusion for parallel HR generation with Asynchronous Structure Guidance (ASG) using pre-trained diffusion models. To solve the pattern repetition problem of HR image generation, ASGDiffusion leverages the low-resolution (LR) noise weighted by the attention mask as the structure guidance for the denoising step to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptical measurement and interference techniques · Optical Systems and Laser Technology · Image Processing Techniques and Applications
MethodsSoftmax · Attention Is All You Need · Diffusion · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
