Rescind: Countering Image Misconduct in Biomedical Publications with Vision-Language and State-Space Modeling
Soumyaroop Nandi, Prem Natarajan

TL;DR
This paper introduces Rescind, a novel framework combining vision-language models and state-space techniques to generate and detect biomedical image forgeries, enhancing research integrity.
Contribution
It presents the first vision-language guided approach for biomedical image forgery detection and introduces Rescind, a comprehensive benchmark with a new structured localization framework.
Findings
Integscan achieves state-of-the-art detection accuracy.
Rescind provides detailed annotations for biomedical images.
The method effectively localizes diverse forgery types.
Abstract
Scientific image manipulation in biomedical publications poses a growing threat to research integrity and reproducibility. Unlike natural image forensics, biomedical forgery detection is uniquely challenging due to domain-specific artifacts, complex textures, and unstructured figure layouts. We present the first vision-language guided framework for both generating and detecting biomedical image forgeries. By combining diffusion-based synthesis with vision-language prompting, our method enables realistic and semantically controlled manipulations, including duplication, splicing, and region removal, across diverse biomedical modalities. We introduce Rescind, a large-scale benchmark featuring fine-grained annotations and modality-specific splits, and propose Integscan, a structured state space modeling framework that integrates attention-enhanced visual encoding with prompt-conditioned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDigital Media Forensic Detection · Cell Image Analysis Techniques · Generative Adversarial Networks and Image Synthesis
