AccDiffusion v2: Towards More Accurate Higher-Resolution Diffusion Extrapolation
Zhihang Lin, Mingbao Lin, Wengyi Zhan, Rongrong Ji

TL;DR
AccDiffusion v2 introduces a novel patch-wise higher-resolution diffusion extrapolation method that decouples prompts, incorporates local structural information, and uses dilated sampling to improve image quality without additional training.
Contribution
The paper presents a new approach for higher-resolution diffusion extrapolation that decouples prompts, integrates local structural cues via ControlNet, and employs dilated sampling for enhanced global semantics.
Findings
Achieves state-of-the-art performance in image extrapolation without training.
Effectively suppresses repetitive generation and local distortion.
Demonstrates superior qualitative and quantitative results.
Abstract
Diffusion models suffer severe object repetition and local distortion when the inference resolution differs from its pre-trained resolution. We propose AccDiffusion v2, an accurate method for patch-wise higher-resolution diffusion extrapolation without training. Our in-depth analysis in this paper shows that using an identical text prompt for different patches leads to repetitive generation, while the absence of a prompt undermines image details. In response, our AccDiffusion v2 novelly decouples the vanilla image-content-aware prompt into a set of patch-content-aware prompts, each of which serves as a more precise description of a patch. Further analysis reveals that local distortion arises from inaccurate descriptions in prompts about the local structure of higher-resolution images. To address this issue, AccDiffusion v2, for the first time, introduces an auxiliary local structural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSeismic Imaging and Inversion Techniques · Advanced Image Processing Techniques
MethodsSparse Evolutionary Training · Diffusion
