Refine-by-Align: Reference-Guided Artifacts Refinement through Semantic Alignment
Yizhi Song, Liu He, Zhifei Zhang, Soo Ye Kim, He Zhang, Wei Xiong, Zhe, Lin, Brian Price, Scott Cohen, Jianming Zhang, Daniel Aliaga

TL;DR
Refine-by-Align is a diffusion-based model that automatically refines localized artifacts in personalized images using reference images, improving fidelity and identity details without test-time tuning.
Contribution
This work introduces the first reference-guided artifacts refinement method employing a diffusion framework with a two-stage alignment and refinement process.
Findings
Significantly improves image fidelity and identity details.
Generalizes across various image synthesis tasks.
No test-time tuning required.
Abstract
Personalized image generation has emerged from the recent advancements in generative models. However, these generated personalized images often suffer from localized artifacts such as incorrect logos, reducing fidelity and fine-grained identity details of the generated results. Furthermore, there is little prior work tackling this problem. To help improve these identity details in the personalized image generation, we introduce a new task: reference-guided artifacts refinement. We present Refine-by-Align, a first-of-its-kind model that employs a diffusion-based framework to address this challenge. Our model consists of two stages: Alignment Stage and Refinement Stage, which share weights of a unified neural network model. Given a generated image, a masked artifact region, and a reference image, the alignment stage identifies and extracts the corresponding regional features in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Natural Language Processing Techniques · Handwritten Text Recognition Techniques
