ImgEdit: A Unified Image Editing Dataset and Benchmark
Yang Ye, Xianyi He, Zongjian Li, Bin Lin, Shenghai Yuan, Zhiyuan Yan, Bohan Hou, Li Yuan

TL;DR
ImgEdit introduces a large, high-quality dataset and benchmark for image editing, enabling significant improvements in open-source image editing models through comprehensive data and evaluation tools.
Contribution
The paper presents ImgEdit, a new extensive dataset and benchmark for image editing, along with a novel editing model, ImgEdit-E1, and a comprehensive evaluation suite.
Findings
ImgEdit dataset surpasses existing datasets in quality and task diversity.
ImgEdit-E1 outperforms existing open-source models on multiple editing tasks.
The benchmark provides detailed insights into current image editing model capabilities.
Abstract
Recent advancements in generative models have enabled high-fidelity text-to-image generation. However, open-source image-editing models still lag behind their proprietary counterparts, primarily due to limited high-quality data and insufficient benchmarks. To overcome these limitations, we introduce ImgEdit, a large-scale, high-quality image-editing dataset comprising 1.2 million carefully curated edit pairs, which contain both novel and complex single-turn edits, as well as challenging multi-turn tasks. To ensure the data quality, we employ a multi-stage pipeline that integrates a cutting-edge vision-language model, a detection model, a segmentation model, alongside task-specific in-painting procedures and strict post-processing. ImgEdit surpasses existing datasets in both task novelty and data quality. Using ImgEdit, we train ImgEdit-E1, an editing model using Vision Language Model to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Image Retrieval and Classification Techniques · Single-cell and spatial transcriptomics
