Learning Fine-Grained Grounded Citations for Attributed Large Language   Models

Lei Huang; Xiaocheng Feng; Weitao Ma; Yuxuan Gu; Weihong Zhong,; Xiachong Feng; Weijiang Yu; Weihua Peng; Duyu Tang; Dandan Tu; Bing Qin

arXiv:2408.04568·cs.CL·August 9, 2024

Learning Fine-Grained Grounded Citations for Attributed Large Language Models

Lei Huang, Xiaocheng Feng, Weitao Ma, Yuxuan Gu, Weihong Zhong,, Xiachong Feng, Weijiang Yu, Weihua Peng, Duyu Tang, Dandan Tu, Bing Qin

PDF

Open Access 1 Repo

TL;DR

This paper introduces FRONT, a training framework that enables large language models to generate fine-grained, grounded citations, improving response accuracy and verifiability by grounding outputs in supporting quotes.

Contribution

FRONT is a novel training approach that enhances citation quality and enables fine-grained verification in attributed LLMs, surpassing existing methods.

Findings

01

Significantly improves citation quality by 14.21% on ALCE benchmark.

02

Outperforms all baselines, including ChatGPT, in generating grounded responses.

03

Enhances verifiability through fine-grained supporting quotes.

Abstract

Despite the impressive performance on information-seeking tasks, large language models (LLMs) still struggle with hallucinations. Attributed LLMs, which augment generated text with in-line citations, have shown potential in mitigating hallucinations and improving verifiability. However, current approaches suffer from suboptimal citation quality due to their reliance on in-context learning. Furthermore, the practice of citing only coarse document identifiers makes it challenging for users to perform fine-grained verification. In this work, we introduce FRONT, a training framework designed to teach LLMs to generate Fine-Grained Grounded Citations. By grounding model outputs in fine-grained supporting quotes, these quotes guide the generation of grounded and consistent responses, not only improving citation quality but also facilitating fine-grained verification. Experiments on the ALCE…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

luckyyysta/fine-grained-attribution
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques