IE-Bench: Advancing the Measurement of Text-Driven Image Editing for   Human Perception Alignment

Shangkun Sun; Bowen Qu; Xiaoyu Liang; Songlin Fan; Wei Gao

arXiv:2501.09927·cs.CV·January 20, 2025

IE-Bench: Advancing the Measurement of Text-Driven Image Editing for Human Perception Alignment

Shangkun Sun, Bowen Qu, Xiaoyu Liang, Songlin Fan, Wei Gao

PDF

Open Access

TL;DR

This paper introduces IE-Bench, a new benchmark and assessment method for evaluating text-driven image editing, aligning better with human perception and providing a standardized way to measure editing quality.

Contribution

The paper presents the first IQA dataset and model specifically designed for text-driven image editing, improving evaluation accuracy and human perception alignment.

Findings

01

IE-QA outperforms previous metrics in subjective alignment

02

IE-Bench includes 3,010 MOS scores from 25 human subjects

03

The dataset covers diverse images and editing prompts

Abstract

Recent advances in text-driven image editing have been significant, yet the task of accurately evaluating these edited images continues to pose a considerable challenge. Different from the assessment of text-driven image generation, text-driven image editing is characterized by simultaneously conditioning on both text and a source image. The edited images often retain an intrinsic connection to the original image, which dynamically change with the semantics of the text. However, previous methods tend to solely focus on text-image alignment or have not aligned with human perception. In this work, we introduce the Text-driven Image Editing Benchmark suite (IE-Bench) to enhance the assessment of text-driven edited images. IE-Bench includes a database contains diverse source images, various editing prompts and the corresponding results different editing methods, and total 3,010 Mean Opinion…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques

MethodsFocus