MedForge: Interpretable Medical Deepfake Detection via Forgery-aware Reasoning

Zhihui Chen; Kai He; Qingyuan Lei; Bin Pu; Jian Zhang; Yuling Xu; Mengling Feng

arXiv:2603.18577·cs.AI·March 20, 2026

MedForge: Interpretable Medical Deepfake Detection via Forgery-aware Reasoning

Zhihui Chen, Kai He, Qingyuan Lei, Bin Pu, Jian Zhang, Yuling Xu, Mengling Feng

PDF

Open Access

TL;DR

MedForge introduces a new framework for detecting medical image forgeries that combines large-scale realistic lesion editing, expert-guided reasoning, and a localized analysis approach to improve accuracy and trustworthiness.

Contribution

The paper presents MedForge, a novel pre-hoc detection method with a large benchmark and a reasoning model that localizes suspicious regions before classifying, enhancing interpretability and reliability in medical forgery detection.

Findings

01

Achieves state-of-the-art detection accuracy.

02

Provides trustworthy, expert-aligned explanations.

03

Reduces hallucinations in forgery reasoning.

Abstract

Text-guided image editors can now manipulate authentic medical scans with high fidelity, enabling lesion implantation/removal that threatens clinical trust and safety. Existing defenses are inadequate for healthcare. Medical detectors are largely black-box, while MLLM-based explainers are typically post-hoc, lack medical expertise, and may hallucinate evidence on ambiguous cases. We present MedForge, a data-and-method solution for pre-hoc, evidence-grounded medical forgery detection. We introduce MedForge-90K, a large-scale benchmark of realistic lesion edits across 19 pathologies with expert-guided reasoning supervision via doctor inspection guidelines and gold edit locations. Building on it, MedForge-Reasoner performs localize-then-analyze reasoning, predicting suspicious regions before producing a verdict, and is further aligned with Forgery-aware GSPO to strengthen grounding and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning