BibAgent: An Agentic Framework for Traceable Miscitation Detection in Scientific Literature
Peiran Li, Fangzhou Lin, Shuo Xing, Xiang Zheng, Xi Hong, Siyuan Yang, Jiashuo Sun, Zhengzhong Tu, Chaoqun Ni

TL;DR
BibAgent is a scalable, automated framework that verifies citations in scientific literature by integrating retrieval, reasoning, and evidence aggregation, effectively detecting miscitations across disciplines.
Contribution
It introduces BibAgent, a novel end-to-end agentic system with a new Evidence Committee mechanism and a comprehensive MisciteBench dataset for systematic citation verification.
Findings
Outperforms state-of-the-art LLM baselines in accuracy
Effective in both accessible and paywalled sources
Provides transparent detection of citation misalignments
Abstract
Citations are the bedrock of scientific authority, yet their integrity is compromised by widespread miscitations: ranging from nuanced distortions to fabricated references. Systematic citation verification is currently unfeasible; manual review cannot scale to modern publishing volumes, while existing automated tools are restricted by abstract-only analysis or small-scale, domain-specific datasets in part due to the "paywall barrier" of full-text access. We introduce BibAgent, a scalable, end-to-end agentic framework for automated citation verification. BibAgent integrates retrieval, reasoning, and adaptive evidence aggregation, applying distinct strategies for accessible and paywalled sources. For paywalled references, it leverages a novel Evidence Committee mechanism that infers citation validity via downstream citation consensus. To support systematic evaluation, we contribute a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · scientometrics and bibliometrics research · Scientific Computing and Data Management
