SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based   Image Editing

Ming Li; Xin Gu; Fan Chen; Xiaoying Xing; Longyin Wen; Chen Chen,; Sijie Zhu

arXiv:2505.02370·cs.CV·May 6, 2025

SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing

Ming Li, Xin Gu, Fan Chen, Xiaoying Xing, Longyin Wen, Chen Chen,, Sijie Zhu

PDF

Open Access 1 Repo 1 Datasets

TL;DR

SuperEdit introduces a novel approach to improve instruction-based image editing by rectifying and contrastively supervising editing instructions, significantly enhancing performance without relying on vision-language model pre-training.

Contribution

The paper proposes a new method for constructing effective editing instructions through rectification and contrastive supervision, eliminating the need for VLM pre-training and achieving superior results.

Findings

01

Achieves 9.19% improvement on Real-Edit benchmark

02

Uses 30x less training data than previous methods

03

Employs a smaller model size while outperforming SOTA

Abstract

Due to the challenges of manually collecting accurate editing data, existing datasets are typically constructed using various automated methods, leading to noisy supervision signals caused by the mismatch between editing instructions and original-edited image pairs. Recent efforts attempt to improve editing models through generating higher-quality edited images, pre-training on recognition tasks, or introducing vision-language models (VLMs) but fail to resolve this fundamental issue. In this paper, we offer a novel solution by constructing more effective editing instructions for given image pairs. This includes rectifying the editing instructions to better align with the original-edited image pairs and using contrastive editing instructions to further enhance their effectiveness. Specifically, we find that editing models exhibit specific generation attributes at different inference…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bytedance/superedit
pytorchOfficial

Datasets

limingcv/SuperEdit-40K
dataset· 317 dl
317 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReflective Practices in Education

MethodsALIGN