Learning to identify image manipulations in scientific publications
Ghazal Mazaheri, Kevin Urrutia Avila, Amit K. Roy-Chowdhury

TL;DR
This paper introduces a deep learning-based framework for detecting duplicated images in scientific publications, achieving high accuracy and improving over existing methods to support scientific integrity.
Contribution
It presents a novel combination of image processing and deep learning techniques specifically designed for identifying image duplications in scientific papers.
Findings
90% accuracy in detecting duplicated images
13% improvement over existing manipulation detection methods
Pre-processing steps significantly enhance detection performance
Abstract
Adherence to scientific community standards ensures objectivity, clarity, reproducibility, and helps prevent bias, fabrication, falsification, and plagiarism. To help scientific integrity officers and journal/publisher reviewers monitor if researchers stick with these standards, it is important to have a solid procedure to detect duplication as one of the most frequent types of manipulation in scientific papers. Images in scientific papers are used to support the experimental description and the discussion of the findings. Therefore, in this work we focus on detecting the duplications in images as one of the most important parts of a scientific paper. We propose a framework that combines image processing and deep learning methods to classify images in the articles as duplicated or unduplicated ones. We show that our method leads to a 90% accuracy rate of detecting duplicated images, a ~…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques · Digital Media Forensic Detection · Academic integrity and plagiarism
