EditSleuth: A Dataset of Grounded Reasoning Chains for Image-Edit Forensics
Van-Loc Nguyen, AprilPyone MaungMaung, Minh-Triet Tran, Isao Echizen

TL;DR
EditSleuth is a comprehensive dataset for grounded reasoning in AI image forensics, enabling detection, localization, and explanation of image edits with a focus on verifiable visual evidence.
Contribution
The paper introduces EditSleuth, a large dataset with deterministic reasoning chains for grounded image-edit forensics, and demonstrates its utility in training explainable forensic models.
Findings
Difficulty scores vary meaningfully within edit categories.
Chain supervision matches label-only accuracy but provides explanations.
Deterministic construction improves evidence grounding.
Abstract
Forensic analysis of AI-edited images requires more than binary real-versus-fake prediction: a useful system should localize the edit, identify its semantic type, and ground its decisions in visual evidence. Existing image-forensics datasets typically emphasize detection or localization, while reasoning-supervised vision-language datasets rarely target image manipulation and often rely on LLM-generated rationales whose faithfulness is difficult to verify. We introduce EditSleuth, a dataset of 257,725 image-edit triplets constructed from existing image-editing corpora for grounded image-edit forensic reasoning. Each example includes an edited image, its source image, a binary edit mask, a 12-class edit taxonomy label, a difficulty score, and a six-step reasoning chain. EditSleuth chains are generated deterministically from triplet-grounded upstream artifacts, with each statement tied to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
