Pub-Guard-LLM: Detecting Retracted Biomedical Articles with Reliable Explanations

Lihu Chen; Shuojie Fu; Gabriel Freedman; Cemre Zor; Guy Martin; James Kinross; Uddhav Vaghela; Ovidiu Serban; Francesca Toni

arXiv:2502.15429·cs.CL·August 22, 2025

Pub-Guard-LLM: Detecting Retracted Biomedical Articles with Reliable Explanations

Lihu Chen, Shuojie Fu, Gabriel Freedman, Cemre Zor, Guy Martin, James Kinross, Uddhav Vaghela, Ovidiu Serban, Francesca Toni

PDF

Open Access 1 Repo 2 Models

TL;DR

Pub-Guard-LLM is a novel large language model-based system designed to detect retracted biomedical articles, offering multiple application modes with reliable explanations, and evaluated on a comprehensive open-source benchmark showing superior performance over baselines.

Contribution

The paper introduces Pub-Guard-LLM, the first LLM-based system for biomedical fraud detection with three deployment modes and an open benchmark, enhancing detection accuracy and explainability.

Findings

01

Pub-Guard-LLM outperforms baselines across all modes.

02

It provides more relevant and coherent explanations.

03

The system is open-source and applicable to real-world data.

Abstract

A significant and growing number of published scientific articles is found to involve fraudulent practices, posing a serious threat to the credibility and safety of research in fields such as medicine. We propose Pub-Guard-LLM, the first large language model-based system tailored to fraud detection of biomedical scientific articles. We provide three application modes for deploying Pub-Guard-LLM: vanilla reasoning, retrieval-augmented generation, and multi-agent debate. Each mode allows for textual explanations of predictions. To assess the performance of our system, we introduce an open-source benchmark, PubMed Retraction, comprising over 11K real-world biomedical articles, including metadata and retraction labels. We show that, across all modes, Pub-Guard-LLM consistently surpasses the performance of various baselines and provides more reliable explanations, namely explanations which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tigerchen52/pub_guard_llm
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAcademic integrity and plagiarism · Biomedical Text Mining and Ontologies · Misinformation and Its Impacts