Learning to Generate Answers with Citations via Factual Consistency   Models

Rami Aly; Zhiqiang Tang; Samson Tan; George Karypis

arXiv:2406.13124·cs.CL·July 16, 2024

Learning to Generate Answers with Citations via Factual Consistency Models

Rami Aly, Zhiqiang Tang, Samson Tan, George Karypis

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces a weakly-supervised fine-tuning method using factual consistency models to improve citation accuracy in language models, significantly reducing factual errors and enhancing verifiability of generated answers.

Contribution

The paper presents a novel weakly-supervised fine-tuning approach that leverages factual consistency models to improve citation accuracy in language models.

Findings

01

Achieves 34.1 citation F1 point improvement on ALCE benchmark.

02

Demonstrates robust transfer of citation generation to unseen datasets.

03

Reduces factual error rate in generated answers.

Abstract

Large Language Models (LLMs) frequently hallucinate, impeding their reliability in mission-critical situations. One approach to address this issue is to provide citations to relevant sources alongside generated content, enhancing the verifiability of generations. However, citing passages accurately in answers remains a substantial challenge. This paper proposes a weakly-supervised fine-tuning method leveraging factual consistency models (FCMs). Our approach alternates between generating texts with citations and supervised fine-tuning with FCM-filtered citation data. Focused learning is integrated into the objective, directing the fine-tuning process to emphasise the factual unit tokens, as measured by an FCM. Results on the ALCE few-shot citation benchmark with various instruction-tuned LLMs demonstrate superior performance compared to in-context learning, vanilla supervised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Learning to Generate Answers with Citations via Factual Consistency Models· underline

Taxonomy

TopicsTopic Modeling · Advanced Text Analysis Techniques · Software Engineering Research

MethodsConsistency Models