EditVal: Benchmarking Diffusion Based Text-Guided Image Editing Methods

Samyadeep Basu; Mehrdad Saberi; Shweta Bhardwaj; Atoosa Malemir; Chegini; Daniela Massiceti; Maziar Sanjabi; Shell Xu Hu; Soheil Feizi

arXiv:2310.02426·cs.CV·October 5, 2023·5 cites

EditVal: Benchmarking Diffusion Based Text-Guided Image Editing Methods

Samyadeep Basu, Mehrdad Saberi, Shweta Bhardwaj, Atoosa Malemir, Chegini, Daniela Massiceti, Maziar Sanjabi, Shell Xu Hu, Soheil Feizi

PDF

Open Access

TL;DR

EditVal introduces a standardized benchmark for evaluating diffusion-based text-guided image editing methods, enabling fair comparison and analysis of their fidelity, robustness, and ability to preserve original image properties across diverse edit types.

Contribution

The paper presents EditVal, a comprehensive benchmark with an evaluation pipeline and dataset, to systematically assess and compare state-of-the-art image editing methods.

Findings

01

Instruct-Pix2Pix and Null-Text best preserve original image properties.

02

Most methods struggle with spatial edits like object repositioning.

03

No single method outperforms others across all edit types.

Abstract

A plethora of text-guided image editing methods have recently been developed by leveraging the impressive capabilities of large-scale diffusion-based generative models such as Imagen and Stable Diffusion. A standardized evaluation protocol, however, does not exist to compare methods across different types of fine-grained edits. To address this gap, we introduce EditVal, a standardized benchmark for quantitatively evaluating text-guided image editing methods. EditVal consists of a curated dataset of images, a set of editable attributes for each image drawn from 13 possible edit types, and an automated evaluation pipeline that uses pre-trained vision-language models to assess the fidelity of generated images for each edit type. We use EditVal to benchmark 8 cutting-edge diffusion-based editing methods including SINE, Imagic and Instruct-Pix2Pix. We complement this with a large-scale human…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCell Image Analysis Techniques

MethodsDiffusion