DEAL-300K: Diffusion-based Editing Area Localization with a 300K-Scale Dataset and Frequency-Prompted Baseline
Rui Zhang, Hongxia Wang, Hangqing Liu, Yang Zhou, Qiang Zeng

TL;DR
This paper introduces DEAL-300K, a large-scale dataset for diffusion-based image manipulation localization, and proposes a frequency-aware localization method that achieves high accuracy, facilitating detection of realistic local forgeries.
Contribution
The paper presents the first extensive dataset for diffusion-based image editing localization and a novel frequency-prompted baseline method leveraging a frozen visual foundation model.
Findings
Achieved 82.56% pixel-level F1 score on DEAL-300K test set.
Outperformed existing methods on external CoCoGlide benchmark.
Provided a practical foundation for future diffusion-based image manipulation localization research.
Abstract
Diffusion-based image editing has made semantic level image manipulation easy for general users, but it also enables realistic local forgeries that are hard to localize. Existing benchmarks mainly focus on the binary detection of generated images or the localization of manually edited regions and do not reflect the properties of diffusion-based edits, which often blend smoothly into the original content. We present Diffusion-Based Image Editing Area Localization Dataset (DEAL-300K), a large scale dataset for diffusion-based image manipulation localization (DIML) with more than 300,000 annotated images. We build DEAL-300K by using a multi-modal large language model to generate editing instructions, a mask-free diffusion editor to produce manipulated images, and an active-learning change detection pipeline to obtain pixel-level annotations. On top of this dataset, we propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Cell Image Analysis Techniques
