UnicEdit-10M: A Dataset and Benchmark Breaking the Scale-Quality Barrier via Unified Verification for Reasoning-Enriched Edits

Keming Ye; Zhipeng Huang; Canmiao Fu; Qingyang Liu; Jiani Cai; Zheqi Lv; Chen Li; Jing Lyu; Zhou Zhao; Shengyu Zhang

arXiv:2512.02790·cs.CV·December 3, 2025

UnicEdit-10M: A Dataset and Benchmark Breaking the Scale-Quality Barrier via Unified Verification for Reasoning-Enriched Edits

Keming Ye, Zhipeng Huang, Canmiao Fu, Qingyang Liu, Jiani Cai, Zheqi Lv, Chen Li, Jing Lyu, Zhou Zhao, Shengyu Zhang

PDF

Open Access 2 Datasets

TL;DR

This paper introduces UnicEdit-10M, a large-scale high-quality dataset and UnicBench, a benchmark for reasoning-enriched image editing, addressing the scale-quality trade-off with a unified verification pipeline and a specialized expert model.

Contribution

It presents a scalable data construction pipeline, a 7B expert verification model, and a comprehensive benchmark with new metrics for reasoning in image editing tasks.

Findings

01

Mainstream models show significant limitations on UnicBench.

02

The verification pipeline effectively filters high-quality data.

03

New metrics reveal detailed reasoning capabilities of models.

Abstract

With the rapid advances of powerful multimodal models such as GPT-4o, Nano Banana, and Seedream 4.0 in Image Editing, the performance gap between closed-source and open-source models is widening, primarily due to the scarcity of large-scale, high-quality training data and comprehensive benchmarks capable of diagnosing model weaknesses across diverse editing behaviors. Existing data construction methods face a scale-quality trade-off: human annotations are high-quality but not scalable, while automated pipelines suffer from error propagation and noise. To address this, we introduce a lightweight data pipeline that replaces multi-toolchains with an end-to-end model and a unified post-verification stage. For scalable quality control, we train a 7B dual-task expert model, \textbf{Qwen-Verify}, for efficient failure detection and instruction recaptioning. This pipeline yields…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI) · Cell Image Analysis Techniques