Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems
Hoang Pham, Thuy-Duong Nguyen, Khac-Hoai Nam Bui

TL;DR
This paper introduces Agent-UniRAG, a trainable open-source LLM agent framework that unifies retrieval-augmented generation for both single-hop and multi-hop queries, improving interpretability and effectiveness in complex question-answering tasks.
Contribution
The paper proposes a novel unified LLM agent framework, Agent-UniRAG, capable of handling diverse RAG tasks end-to-end, and introduces SynAgent-RAG, a synthetic dataset for training small open-source LLMs.
Findings
Achieves comparable performance with larger closed-source LLMs on RAG benchmarks.
Enables interpretability and step-by-step reasoning in RAG tasks.
Supports both single-hop and multi-hop queries in a unified framework.
Abstract
This paper presents a novel approach for unified retrieval-augmented generation (RAG) systems using the recent emerging large language model (LLM) agent concept. Specifically, Agent LLM, which utilizes LLM as fundamental controllers, has become a promising approach to enable the interpretability of RAG tasks, especially for complex reasoning question-answering systems (e.g., multi-hop queries). Nonetheless, previous works mainly focus on solving RAG systems with either single-hop or multi-hop approaches separately, which limits the application of those approaches to real-world applications. In this study, we propose a trainable agent framework called Agent-UniRAG for unified retrieval-augmented LLM systems, which enhances the effectiveness and interpretability of RAG systems. The main idea is to design an LLM agent framework to solve RAG tasks step-by-step based on the complexity of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPower Systems and Technologies · Semantic Web and Ontologies
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Attention Dropout · Softmax · WordPiece · Weight Decay · Multi-Head Attention · Layer Normalization · Byte Pair Encoding
