Improving Attributed Text Generation of Large Language Models via   Preference Learning

Dongfang Li; Zetian Sun; Baotian Hu; Zhenyu Liu; Xinshuo Hu; Xuebo; Liu; Min Zhang

arXiv:2403.18381·cs.CL·March 28, 2024·1 cites

Improving Attributed Text Generation of Large Language Models via Preference Learning

Dongfang Li, Zetian Sun, Baotian Hu, Zhenyu Liu, Xinshuo Hu, Xuebo, Liu, Min Zhang

PDF

Open Access

TL;DR

This paper introduces a novel preference learning framework called Automatic Preference Optimization (APO) to improve the credibility and quality of attributed text generation in large language models by modeling citation as a preference task.

Contribution

It proposes a new APO framework, creates a curated dataset, and synthesizes large-scale preference data to enhance attribution in language models.

Findings

01

APO achieves state-of-the-art citation F1 scores.

02

APO improves answer quality in attributed text generation.

03

Synthesized preference data effectively trains the model.

Abstract

Large language models have been widely adopted in natural language processing, yet they face the challenge of generating unreliable content. Recent works aim to reduce misinformation and hallucinations by resorting to attribution as a means to provide evidence (i.e., citations). However, current attribution methods usually focus on the retrieval stage and automatic evaluation that neglect mirroring the citation mechanisms in human scholarly writing to bolster credibility. In this paper, we address these challenges by modelling the attribution task as preference learning and introducing an Automatic Preference Optimization (APO) framework. First, we create a curated collection for post-training with 6,330 examples by collecting and filtering from existing datasets. Second, considering the high cost of labelling preference data, we further propose an automatic method to synthesize…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsFocus