CiteGuard: Faithful Citation Attribution for LLMs via Retrieval-Augmented Validation

Yee Man Choi; Xuehang Guo; Yi R. Fung; Qingyun Wang

arXiv:2510.17853·cs.DL·April 14, 2026

CiteGuard: Faithful Citation Attribution for LLMs via Retrieval-Augmented Validation

Yee Man Choi, Xuehang Guo, Yi R. Fung, Qingyun Wang

PDF

1 Repo

TL;DR

CiteGuard is a retrieval-aware framework that improves citation validation accuracy for LLM-generated scientific text, nearing human-level performance and generalizing across domains.

Contribution

The paper introduces CiteGuard, a novel retrieval-augmented method for more faithful citation attribution validation in LLMs, outperforming previous baselines.

Findings

01

CiteGuard improves citation validation accuracy by 10 percentage points.

02

Achieves up to 68.1% accuracy on the CiteME benchmark.

03

Demonstrates cross-domain citation attribution generalization.

Abstract

Large Language Models (LLMs) have emerged as powerful assistants for scientific writing. However, concerns remain about the quality and reliability of the generated text, including citation accuracy and faithfulness. While most recent work relies on methods such as LLM-as-a-Judge, the reliability of LLM-as-a-Judge alone is also in doubt. In this work, we reframe citation evaluation as a problem of citation attribution alignment, which assesses whether LLM-generated citations match those a human author would include for the same text. We propose CiteGuard, a retrieval-aware agent framework designed to provide more faithful grounding for citation validation. CiteGuard improves over the prior baseline by 10 percentage points and achieves up to 68.1% accuracy on the CiteME benchmark, approaching human performance (69.2%). It also identifies alternative valid citations and demonstrates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

KathCYM/CiteGuard
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.