Direct Retrieval-augmented Optimization: Synergizing Knowledge Selection   and Language Models

Zhengliang Shi; Lingyong Yan; Weiwei Sun; Yue Feng; Pengjie Ren; Xinyu; Ma; Shuaiqiang Wang; Dawei Yin; Maarten de Rijke; Zhaochun Ren

arXiv:2505.03075·cs.IR·May 7, 2025

Direct Retrieval-augmented Optimization: Synergizing Knowledge Selection and Language Models

Zhengliang Shi, Lingyong Yan, Weiwei Sun, Yue Feng, Pengjie Ren, Xinyu, Ma, Shuaiqiang Wang, Dawei Yin, Maarten de Rijke, Zhaochun Ren

PDF

Open Access 1 Repo

TL;DR

This paper introduces DRO, a novel end-to-end training framework for retrieval-augmented generation that jointly optimizes knowledge selection and language generation, significantly improving factual accuracy.

Contribution

DRO enables end-to-end training of retrieval and generation components using a variational approach with importance sampling, addressing limitations of previous methods.

Findings

01

DRO outperforms baselines with 5%-15% improvements in EM and F1.

02

The framework demonstrates stability, convergence, and reduced variance in experiments.

03

Theoretical analysis links DRO to policy-gradient methods in reinforcement learning.

Abstract

Retrieval-augmented generation (RAG) integrates large language models ( LLM s) with retrievers to access external knowledge, improving the factuality of LLM generation in knowledge-grounded tasks. To optimize the RAG performance, most previous work independently fine-tunes the retriever to adapt to frozen LLM s or trains the LLMs to use documents retrieved by off-the-shelf retrievers, lacking end-to-end training supervision. Recent work addresses this limitation by jointly training these two components but relies on overly simplifying assumptions of document independence, which has been criticized for being far from real-world scenarios. Thus, effectively optimizing the overall RAG performance remains a critical challenge. We propose a direct retrieval-augmented optimization framework, named DRO, that enables end-to-end training of two key components: (i) a generative knowledge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mangopy/direct-rag-learning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI-based Problem Solving and Planning

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Warmup With Linear Decay · Dropout · Layer Normalization · Byte Pair Encoding · Attention Dropout · Softmax · Residual Connection · WordPiece