EditVal: Benchmarking Diffusion Based Text-Guided Image Editing Methods
Samyadeep Basu, Mehrdad Saberi, Shweta Bhardwaj, Atoosa Malemir, Chegini, Daniela Massiceti, Maziar Sanjabi, Shell Xu Hu, Soheil Feizi

TL;DR
EditVal introduces a standardized benchmark for evaluating diffusion-based text-guided image editing methods, enabling fair comparison and analysis of their fidelity, robustness, and ability to preserve original image properties across diverse edit types.
Contribution
The paper presents EditVal, a comprehensive benchmark with an evaluation pipeline and dataset, to systematically assess and compare state-of-the-art image editing methods.
Findings
Instruct-Pix2Pix and Null-Text best preserve original image properties.
Most methods struggle with spatial edits like object repositioning.
No single method outperforms others across all edit types.
Abstract
A plethora of text-guided image editing methods have recently been developed by leveraging the impressive capabilities of large-scale diffusion-based generative models such as Imagen and Stable Diffusion. A standardized evaluation protocol, however, does not exist to compare methods across different types of fine-grained edits. To address this gap, we introduce EditVal, a standardized benchmark for quantitatively evaluating text-guided image editing methods. EditVal consists of a curated dataset of images, a set of editable attributes for each image drawn from 13 possible edit types, and an automated evaluation pipeline that uses pre-trained vision-language models to assess the fidelity of generated images for each edit type. We use EditVal to benchmark 8 cutting-edge diffusion-based editing methods including SINE, Imagic and Instruct-Pix2Pix. We complement this with a large-scale human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques
MethodsDiffusion
