EditSleuth: A Dataset of Grounded Reasoning Chains for Image-Edit Forensics

Van-Loc Nguyen; AprilPyone MaungMaung; Minh-Triet Tran; Isao Echizen

arXiv:2605.08695·cs.CV·May 12, 2026

EditSleuth: A Dataset of Grounded Reasoning Chains for Image-Edit Forensics

Van-Loc Nguyen, AprilPyone MaungMaung, Minh-Triet Tran, Isao Echizen

PDF

TL;DR

EditSleuth is a comprehensive dataset for grounded reasoning in AI image forensics, enabling detection, localization, and explanation of image edits with a focus on verifiable visual evidence.

Contribution

The paper introduces EditSleuth, a large dataset with deterministic reasoning chains for grounded image-edit forensics, and demonstrates its utility in training explainable forensic models.

Findings

01

Difficulty scores vary meaningfully within edit categories.

02

Chain supervision matches label-only accuracy but provides explanations.

03

Deterministic construction improves evidence grounding.

Abstract

Forensic analysis of AI-edited images requires more than binary real-versus-fake prediction: a useful system should localize the edit, identify its semantic type, and ground its decisions in visual evidence. Existing image-forensics datasets typically emphasize detection or localization, while reasoning-supervised vision-language datasets rarely target image manipulation and often rely on LLM-generated rationales whose faithfulness is difficult to verify. We introduce EditSleuth, a dataset of 257,725 image-edit triplets constructed from existing image-editing corpora for grounded image-edit forensic reasoning. Each example includes an edited image, its source image, a binary edit mask, a 12-class edit taxonomy label, a difficulty score, and a six-step reasoning chain. EditSleuth chains are generated deterministically from triplet-grounded upstream artifacts, with each statement tied to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.