TL;DR
CiteGuard is a retrieval-aware framework that improves citation validation accuracy for LLM-generated scientific text, nearing human-level performance and generalizing across domains.
Contribution
The paper introduces CiteGuard, a novel retrieval-augmented method for more faithful citation attribution validation in LLMs, outperforming previous baselines.
Findings
CiteGuard improves citation validation accuracy by 10 percentage points.
Achieves up to 68.1% accuracy on the CiteME benchmark.
Demonstrates cross-domain citation attribution generalization.
Abstract
Large Language Models (LLMs) have emerged as powerful assistants for scientific writing. However, concerns remain about the quality and reliability of the generated text, including citation accuracy and faithfulness. While most recent work relies on methods such as LLM-as-a-Judge, the reliability of LLM-as-a-Judge alone is also in doubt. In this work, we reframe citation evaluation as a problem of citation attribution alignment, which assesses whether LLM-generated citations match those a human author would include for the same text. We propose CiteGuard, a retrieval-aware agent framework designed to provide more faithful grounding for citation validation. CiteGuard improves over the prior baseline by 10 percentage points and achieves up to 68.1% accuracy on the CiteME benchmark, approaching human performance (69.2%). It also identifies alternative valid citations and demonstrates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
