PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization

Jiayi Wu; Hengyi Cai; Lingyong Yan; Hao Sun; Xiang Li; Shuaiqiang; Wang; Dawei Yin; Ming Gao

arXiv:2412.14510·cs.CL·December 20, 2024

PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization

Jiayi Wu, Hengyi Cai, Lingyong Yan, Hao Sun, Xiang Li, Shuaiqiang, Wang, Dawei Yin, Ming Gao

PDF

Open Access 1 Repo 3 Models 1 Datasets 1 Video

TL;DR

PA-RAG introduces a multi-perspective preference optimization method to improve retrieval-augmented generation, enhancing informativeness, robustness, and citation quality of large language models through fine-tuning and preference alignment.

Contribution

It proposes a novel multi-perspective preference alignment approach for RAG, combining SFT and DPO to better align generators with RAG requirements.

Findings

01

Significant performance improvements on four QA datasets.

02

Effective alignment with RAG requirements across multiple LLMs.

03

Enhanced response quality in informativeness and robustness.

Abstract

The emergence of Retrieval-augmented generation (RAG) has alleviated the issues of outdated and hallucinatory content in the generation of large language models (LLMs), yet it still reveals numerous limitations. When a general-purpose LLM serves as the RAG generator, it often suffers from inadequate response informativeness, response robustness, and citation quality. Past approaches to tackle these limitations, either by incorporating additional steps beyond generating responses or optimizing the generator through supervised fine-tuning (SFT), still failed to align with the RAG requirement thoroughly. Consequently, optimizing the RAG generator from multiple preference perspectives while maintaining its end-to-end LLM form remains a challenge. To bridge this gap, we propose Multiple Perspective Preference Alignment for Retrieval-Augmented Generation (PA-RAG), a method for optimizing the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wujwyi/pa-rag
pytorchOfficial

Models

Datasets

wuqiong1/PA-RAG_training_data
dataset· 46 dl
46 dl

Videos

PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization· underline

Taxonomy

TopicsManufacturing Process and Optimization · Industrial Technology and Control Systems

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Multi-Head Attention · Residual Connection · Adam · Layer Normalization · Weight Decay · Softmax · WordPiece