Understand What LLM Needs: Dual Preference Alignment for   Retrieval-Augmented Generation

Guanting Dong; Yutao Zhu; Chenghao Zhang; Zechen Wang; Zhicheng Dou,; Ji-Rong Wen

arXiv:2406.18676·cs.CL·July 19, 2024·2 cites

Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation

Guanting Dong, Yutao Zhu, Chenghao Zhang, Zechen Wang, Zhicheng Dou,, Ji-Rong Wen

PDF

Open Access 1 Repo

TL;DR

This paper introduces DPA-RAG, a universal framework that aligns diverse knowledge preferences in retrieval-augmented generation systems, improving their reliability and performance across multiple datasets.

Contribution

The paper proposes a novel preference alignment framework with data construction and augmentation strategies, enabling both external and internal preference alignment in RAG systems.

Findings

01

DPA-RAG outperforms all baselines on four QA datasets.

02

It effectively integrates black-box and open-source LLM readers.

03

Provides empirical guidance for reliable RAG system development.

Abstract

Retrieval-augmented generation (RAG) has demonstrated effectiveness in mitigating the hallucination problem of large language models (LLMs). However, the difficulty of aligning the retriever with the diverse LLMs' knowledge preferences inevitably poses an inevitable challenge in developing a reliable RAG system. To address this issue, we propose DPA-RAG, a universal framework designed to align diverse knowledge preferences within RAG systems. Specifically, we initially introduce a preference knowledge construction pipline and incorporate five novel query augmentation strategies to alleviate preference data scarcity. Based on preference data, DPA-RAG accomplishes both external and internal preference alignment: 1) It jointly integrate pair-wise, point-wise, and contrastive preference alignment abilities into the reranker, achieving external preference alignment among RAG components. 2)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dongguanting/dpa-rag
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Web Data Mining and Analysis

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Weight Decay · WordPiece · Softmax · Layer Normalization · Linear Warmup With Linear Decay · Byte Pair Encoding · Attention Dropout · Dropout