Optimizing Knowledge Integration in Retrieval-Augmented Generation with   Self-Selection

Yan Weng; Fengbin Zhu; Tong Ye; Haoyan Liu; Fuli Feng; Tat-Seng Chua

arXiv:2502.06148·cs.CL·February 11, 2025

Optimizing Knowledge Integration in Retrieval-Augmented Generation with Self-Selection

Yan Weng, Fengbin Zhu, Tong Ye, Haoyan Liu, Fuli Feng, Tat-Seng Chua

PDF

Open Access

TL;DR

This paper introduces a Self-Selection RAG framework that improves knowledge integration in LLMs by enabling the model to select the best response from internally generated and externally retrieved knowledge, enhancing accuracy.

Contribution

The paper presents a novel Self-Selection RAG method with a training approach using DPO on RGP data, improving knowledge integration in LLMs.

Findings

01

Outperforms baseline methods on Natural Questions and TrivialQA datasets.

02

Effective in enhancing accuracy by combining internal and external knowledge.

03

Demonstrates superiority with open-source LLMs like Llama2-13B-Chat and Mistral-7B.

Abstract

Retrieval-Augmented Generation (RAG), which integrates external knowledge into Large Language Models (LLMs), has proven effective in enabling LLMs to produce more accurate and reliable responses. However, it remains a significant challenge how to effectively integrate external retrieved knowledge with internal parametric knowledge in LLMs. In this work, we propose a novel Self-Selection RAG framework, where the LLM is made to select from pairwise responses generated with internal parametric knowledge solely and with external retrieved knowledge together to achieve enhanced accuracy. To this end, we devise a Self-Selection-RGP method to enhance the capabilities of the LLM in both generating and selecting the correct answer, by training the LLM with Direct Preference Optimization (DPO) over a curated Retrieval Generation Preference (RGP) dataset. Experimental results with two open-source…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Recommender Systems and Techniques · AI-based Problem Solving and Planning

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Attention Dropout · Softmax · WordPiece · Weight Decay · Multi-Head Attention · Layer Normalization · Byte Pair Encoding