IE-Bench: Advancing the Measurement of Text-Driven Image Editing for Human Perception Alignment
Shangkun Sun, Bowen Qu, Xiaoyu Liang, Songlin Fan, Wei Gao

TL;DR
This paper introduces IE-Bench, a new benchmark and assessment method for evaluating text-driven image editing, aligning better with human perception and providing a standardized way to measure editing quality.
Contribution
The paper presents the first IQA dataset and model specifically designed for text-driven image editing, improving evaluation accuracy and human perception alignment.
Findings
IE-QA outperforms previous metrics in subjective alignment
IE-Bench includes 3,010 MOS scores from 25 human subjects
The dataset covers diverse images and editing prompts
Abstract
Recent advances in text-driven image editing have been significant, yet the task of accurately evaluating these edited images continues to pose a considerable challenge. Different from the assessment of text-driven image generation, text-driven image editing is characterized by simultaneously conditioning on both text and a source image. The edited images often retain an intrinsic connection to the original image, which dynamically change with the semantics of the text. However, previous methods tend to solely focus on text-image alignment or have not aligned with human perception. In this work, we introduce the Text-driven Image Editing Benchmark suite (IE-Bench) to enhance the assessment of text-driven edited images. IE-Bench includes a database contains diverse source images, various editing prompts and the corresponding results different editing methods, and total 3,010 Mean Opinion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques
MethodsFocus
