SINE: SINgle Image Editing with Text-to-Image Diffusion Models

Zhixing Zhang; Ligong Han; Arnab Ghosh; Dimitris Metaxas; Jian Ren

arXiv:2212.04489·cs.CV·April 1, 2025·6 cites

SINE: SINgle Image Editing with Text-to-Image Diffusion Models

Zhixing Zhang, Ligong Han, Arnab Ghosh, Dimitris Metaxas, Jian Ren

PDF

Open Access 1 Repo

TL;DR

SINE introduces a novel single-image editing method using diffusion models, leveraging model-based guidance and patch-based fine-tuning to enable diverse and high-resolution edits without overfitting.

Contribution

It proposes a new guidance technique and patch-based fine-tuning for effective single-image editing with diffusion models, overcoming overfitting and content leakage issues.

Findings

01

Effective style and content editing demonstrated

02

High-resolution image generation achieved

03

Model-guided approach outperforms fine-tuning methods

Abstract

Recent works on diffusion models have demonstrated a strong capability for conditioning image generation, e.g., text-guided image synthesis. Such success inspires many efforts trying to use large-scale pre-trained diffusion models for tackling a challenging problem--real image editing. Works conducted in this area learn a unique textual token corresponding to several images containing the same object. However, under many circumstances, only one image is available, such as the painting of the Girl with a Pearl Earring. Using existing works on fine-tuning the pre-trained diffusion models with a single image causes severe overfitting issues. The information leakage from the pre-trained diffusion models makes editing can not keep the same content as the given image while creating new features depicted by the language guidance. This work aims to address the problem of single-image editing.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhang-zx/sine
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications

MethodsDiffusion