Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems

Hoang Pham; Thuy-Duong Nguyen; Khac-Hoai Nam Bui

arXiv:2505.22571·cs.CL·June 2, 2025

Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems

Hoang Pham, Thuy-Duong Nguyen, Khac-Hoai Nam Bui

PDF

Open Access

TL;DR

This paper introduces Agent-UniRAG, a trainable open-source LLM agent framework that unifies retrieval-augmented generation for both single-hop and multi-hop queries, improving interpretability and effectiveness in complex question-answering tasks.

Contribution

The paper proposes a novel unified LLM agent framework, Agent-UniRAG, capable of handling diverse RAG tasks end-to-end, and introduces SynAgent-RAG, a synthetic dataset for training small open-source LLMs.

Findings

01

Achieves comparable performance with larger closed-source LLMs on RAG benchmarks.

02

Enables interpretability and step-by-step reasoning in RAG tasks.

03

Supports both single-hop and multi-hop queries in a unified framework.

Abstract

This paper presents a novel approach for unified retrieval-augmented generation (RAG) systems using the recent emerging large language model (LLM) agent concept. Specifically, Agent LLM, which utilizes LLM as fundamental controllers, has become a promising approach to enable the interpretability of RAG tasks, especially for complex reasoning question-answering systems (e.g., multi-hop queries). Nonetheless, previous works mainly focus on solving RAG systems with either single-hop or multi-hop approaches separately, which limits the application of those approaches to real-world applications. In this study, we propose a trainable agent framework called Agent-UniRAG for unified retrieval-augmented LLM systems, which enhances the effectiveness and interpretability of RAG systems. The main idea is to design an LLM agent framework to solve RAG tasks step-by-step based on the complexity of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPower Systems and Technologies · Semantic Web and Ontologies

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Attention Dropout · Softmax · WordPiece · Weight Decay · Multi-Head Attention · Layer Normalization · Byte Pair Encoding