Learning Fine-Grained Grounded Citations for Attributed Large Language Models
Lei Huang, Xiaocheng Feng, Weitao Ma, Yuxuan Gu, Weihong Zhong,, Xiachong Feng, Weijiang Yu, Weihua Peng, Duyu Tang, Dandan Tu, Bing Qin

TL;DR
This paper introduces FRONT, a training framework that enables large language models to generate fine-grained, grounded citations, improving response accuracy and verifiability by grounding outputs in supporting quotes.
Contribution
FRONT is a novel training approach that enhances citation quality and enables fine-grained verification in attributed LLMs, surpassing existing methods.
Findings
Significantly improves citation quality by 14.21% on ALCE benchmark.
Outperforms all baselines, including ChatGPT, in generating grounded responses.
Enhances verifiability through fine-grained supporting quotes.
Abstract
Despite the impressive performance on information-seeking tasks, large language models (LLMs) still struggle with hallucinations. Attributed LLMs, which augment generated text with in-line citations, have shown potential in mitigating hallucinations and improving verifiability. However, current approaches suffer from suboptimal citation quality due to their reliance on in-context learning. Furthermore, the practice of citing only coarse document identifiers makes it challenging for users to perform fine-grained verification. In this work, we introduce FRONT, a training framework designed to teach LLMs to generate Fine-Grained Grounded Citations. By grounding model outputs in fine-grained supporting quotes, these quotes guide the generation of grounded and consistent responses, not only improving citation quality but also facilitating fine-grained verification. Experiments on the ALCE…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
