AI Detectors are Poor Western Blot Classifiers: A Study of Accuracy and Predictive Values
Romain-Daniel Gosselin

TL;DR
This study evaluates three free AI detectors for identifying AI-generated Western blot images, revealing their poor accuracy and predictive values, and emphasizing the need for specialized detection tools in scientific image verification.
Contribution
It provides a systematic assessment of existing free AI detectors' effectiveness on scientific Western blot images, highlighting their limitations and the necessity for specialized detection methods.
Findings
Detectors show high sensitivity but low specificity.
Positive predictive values are very low at realistic AI prevalence.
Reducing image size slightly improves some detection metrics.
Abstract
The recent rise of generative artificial intelligence (GenAI) capable of creating scientific images presents a challenge in the fight against academic fraud. This study evaluates the efficacy of three free web-based AI detectors in identifying AI-generated images of Western blots, which is a very common technique in biology. We tested these detectors on a collection of artificial Western blot images (n=48) that were created using ChatGPT 4 DALLE 3 and on authentic Western blots (n=48) that were sampled from articles published within four biology journals in 2015; this was before the rise of generative AI based on large language models. The results reveal that the sensitivity (0.9583 for Is It AI, 0.1875 for Hive Moderation, and 0.7083 for Illuminarty) and specificity (0.5417 for Is It AI, 0.8750 for Hive Moderation, and 0.4167 for Illuminarty) are very different. Positive predictive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImpact of AI and Big Data on Business and Society · Machine Learning and Data Classification
