PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data

Shijie Huang; Yiren Song; Yuxuan Zhang; Hailong Guo; Xueyin Wang; Mike; Zheng Shou; Jiaming Liu

arXiv:2502.14397·cs.CV·February 25, 2025

PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data

Shijie Huang, Yiren Song, Yuxuan Zhang, Hailong Guo, Xueyin Wang, Mike, Zheng Shou, Jiaming Liu

PDF

Open Access 1 Repo

TL;DR

PhotoDoodle is a new image editing framework that enables artists to seamlessly overlay decorative elements onto photos by learning from limited paired data, capturing artistic styles efficiently.

Contribution

It introduces a two-stage training approach combining large-scale pretraining with fine-tuning on small datasets, and includes a new dataset for artistic image editing.

Findings

01

Outperforms existing methods in artistic image editing tasks

02

Effectively captures and reproduces artist-specific styles with limited data

03

Demonstrates robustness and versatility across multiple artistic styles

Abstract

We introduce PhotoDoodle, a novel image editing framework designed to facilitate photo doodling by enabling artists to overlay decorative elements onto photographs. Photo doodling is challenging because the inserted elements must appear seamlessly integrated with the background, requiring realistic blending, perspective alignment, and contextual coherence. Additionally, the background must be preserved without distortion, and the artist's unique style must be captured efficiently from limited training data. These requirements are not addressed by previous methods that primarily focus on global style transfer or regional inpainting. The proposed method, PhotoDoodle, employs a two-stage training strategy. Initially, we train a general-purpose image editing model, OmniEditor, using large-scale data. Subsequently, we fine-tune this model with EditLoRA using a small, artist-curated dataset…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

showlab/PhotoDoodle
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis

MethodsFocus