AssistRAG: Boosting the Potential of Large Language Models with an   Intelligent Information Assistant

Yujia Zhou; Zheng Liu; Zhicheng Dou

arXiv:2411.06805·cs.CL·November 12, 2024

AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant

Yujia Zhou, Zheng Liu, Zhicheng Dou

PDF

Open Access 1 Repo

TL;DR

AssistRAG introduces an intelligent assistant within LLMs that manages knowledge and reasoning through a two-phase training process, significantly improving factual accuracy and complex reasoning over previous retrieval-augmented methods.

Contribution

The paper presents AssistRAG, a novel framework that integrates an intelligent assistant into LLMs, enhancing retrieval and reasoning without extensive retraining.

Findings

01

Outperforms existing benchmarks in accuracy and reasoning.

02

Benefits less advanced LLMs significantly.

03

Enhances factual correctness and decision-making.

Abstract

The emergence of Large Language Models (LLMs) has significantly advanced natural language processing, but these models often generate factually incorrect information, known as "hallucination". Initial retrieval-augmented generation (RAG) methods like the "Retrieve-Read" framework was inadequate for complex reasoning tasks. Subsequent prompt-based RAG strategies and Supervised Fine-Tuning (SFT) methods improved performance but required frequent retraining and risked altering foundational LLM capabilities. To cope with these challenges, we propose Assistant-based Retrieval-Augmented Generation (AssistRAG), integrating an intelligent information assistant within LLMs. This assistant manages memory and knowledge through tool usage, action execution, memory building, and plan specification. Using a two-phase training approach, Curriculum Assistant Learning and Reinforced Preference…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

smallporridge/assistrag
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Dropout · Linear Warmup With Linear Decay · WordPiece · Dense Connections · Layer Normalization · Adam · Attention Dropout