SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing
Ming Li, Xin Gu, Fan Chen, Xiaoying Xing, Longyin Wen, Chen Chen,, Sijie Zhu

TL;DR
SuperEdit introduces a novel approach to improve instruction-based image editing by rectifying and contrastively supervising editing instructions, significantly enhancing performance without relying on vision-language model pre-training.
Contribution
The paper proposes a new method for constructing effective editing instructions through rectification and contrastive supervision, eliminating the need for VLM pre-training and achieving superior results.
Findings
Achieves 9.19% improvement on Real-Edit benchmark
Uses 30x less training data than previous methods
Employs a smaller model size while outperforming SOTA
Abstract
Due to the challenges of manually collecting accurate editing data, existing datasets are typically constructed using various automated methods, leading to noisy supervision signals caused by the mismatch between editing instructions and original-edited image pairs. Recent efforts attempt to improve editing models through generating higher-quality edited images, pre-training on recognition tasks, or introducing vision-language models (VLMs) but fail to resolve this fundamental issue. In this paper, we offer a novel solution by constructing more effective editing instructions for given image pairs. This includes rectifying the editing instructions to better align with the original-edited image pairs and using contrastive editing instructions to further enhance their effectiveness. Specifically, we find that editing models exhibit specific generation attributes at different inference…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReflective Practices in Education
MethodsALIGN
