Effective Large Language Model Adaptation for Improved Grounding and   Citation Generation

Xi Ye; Ruoxi Sun; Sercan \"O. Arik; Tomas Pfister

arXiv:2311.09533·cs.CL·April 4, 2024·2 cites

Effective Large Language Model Adaptation for Improved Grounding and Citation Generation

Xi Ye, Ruoxi Sun, Sercan \"O. Arik, Tomas Pfister

PDF

Open Access 1 Video

TL;DR

This paper introduces AGREE, a framework that enhances large language models by grounding their responses in retrieved passages and providing accurate citations, thereby reducing hallucinations and improving factual accuracy.

Contribution

The paper presents a novel tuning-based framework for LLMs that improves grounding and citation accuracy, including a method for automatic data construction and test-time adaptation.

Findings

01

Outperforms prompting-based approaches in grounding accuracy.

02

Generates more accurate citations across multiple datasets.

03

Enables test-time adaptation for improved response quality.

Abstract

Large language models (LLMs) have achieved remarkable advancements in natural language understanding and generation. However, one major issue towards their widespread deployment in the real world is that they can generate "hallucinated" answers that are not factual. Towards this end, this paper focuses on improving LLMs by grounding their responses in retrieved passages and by providing citations. We propose a new framework, AGREE, Adaptation for GRounding EnhancEment, that improves the grounding from a holistic perspective. Our framework tunes LLMs to selfground the claims in their responses and provide accurate citations to retrieved documents. This tuning on top of the pre-trained LLMs requires well-grounded responses (with citations) for paired queries, for which we introduce a method that can automatically construct such data from unlabeled queries. The selfgrounding capability of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Effective Large Language Model Adaptation for Improved Grounding and Citation Generation· underline

Taxonomy

TopicsTopic Modeling