SOEDiff: Efficient Distillation for Small Object Editing
Yiming Wu, Qihe Pan, Zhen Zhao, Zicheng Wang, Sifan Long, Ronghua, Liang

TL;DR
This paper introduces SOEDiff, a training-based method that enhances small object editing in images by fine-tuning diffusion models with minimal training costs, significantly improving quality and accuracy.
Contribution
We propose SOEDiff, a novel approach combining SO-LoRA and Cross-Scale Score Distillation to improve small object editing in diffusion models with efficient training.
Findings
Improved CLIP-Score by 0.99 on OpenImage-f dataset.
Reduced FID by 2.87, indicating higher image quality.
Validated effectiveness on MSCOCO and OpenImage datasets.
Abstract
In this paper, we delve into a new task known as small object editing (SOE), which focuses on text-based image inpainting within a constrained, small-sized area. Despite the remarkable success have been achieved by current image inpainting approaches, their application to the SOE task generally results in failure cases such as Object Missing, Text-Image Mismatch, and Distortion. These failures stem from the limited use of small-sized objects in training datasets and the downsampling operations employed by U-Net models, which hinders accurate generation. To overcome these challenges, we introduce a novel training-based approach, SOEDiff, aimed at enhancing the capability of baseline models like StableDiffusion in editing small-sized objects while minimizing training costs. Specifically, our method involves two key components: SO-LoRA, which efficiently fine-tunes low-rank matrices, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModular Robots and Swarm Intelligence · Robotics and Automated Systems · Advanced Data Storage Technologies
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Max Pooling · Inpainting · Convolution · U-Net · Diffusion
