Are Finer Citations Always Better? Rethinking Granularity for Attributed Generation
Hexuan Wang, Jingyu Zhang, Benjamin Van Durme, Daniel Khashabi

TL;DR
This paper investigates how citation granularity affects attributed generation, revealing that intermediate levels like paragraph citations optimize attribution quality and that fine-grained citations can impair model performance.
Contribution
It demonstrates that citation granularity significantly impacts attribution quality, with optimal levels varying by model scale, challenging the assumption that finer citations are always better.
Findings
Attribution quality peaks at intermediate granularities like paragraph-level.
Fine-grained (sentence-level) citations degrade attribution quality by 16-276%.
Larger models are more negatively affected by fine-grained citation constraints.
Abstract
Citation granularity - whether to cite individual sentences, paragraphs, or documents - is a critical design choice in attributed generation. While fine-grained citations are often preferred for precise human verification, their impact on model performance remains under-explored. We analyze four model scales (8B-120B) and demonstrate that enforcing fine-grained citations degrades attribution quality by 16-276% compared to the best-performing granularity. We observe a consistent performance pattern where attribution quality peaks at intermediate granularities (paragraph-level). Our analysis suggests that fine-grained (sentence-level) citations disrupt necessary semantic dependencies for attributing evidence to answer claims, while excessively coarse citations (multi-paragraph) introduce distracting noise. Importantly, the magnitude of this performance gap varies non-monotonically with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
