HydraRAG: Structured Cross-Source Enhanced Large Language Model Reasoning
Xingyu Tan, Xiaoyang Wang, Qing Liu, Xiwei Xu, Xin Yuan, Liming Zhu, Wenjie Zhang

TL;DR
HydraRAG is a training-free framework that enhances large language model reasoning by effectively integrating structured knowledge graphs and unstructured documents for multi-hop, multi-entity, and multi-source verification tasks, achieving state-of-the-art results.
Contribution
HydraRAG introduces a novel, training-free approach that unifies graph topology, document semantics, and source reliability to improve reasoning in LLMs across multiple sources.
Findings
Achieves state-of-the-art performance on seven benchmarks with GPT-3.5-Turbo.
Outperforms baseline ToG-2 by an average of 20.3%.
Enables smaller models to match GPT-4-Turbo reasoning performance.
Abstract
Retrieval-augmented generation (RAG) enhances large language models (LLMs) by incorporating external knowledge. Current hybrid RAG system retrieves evidence from both knowledge graphs (KGs) and text documents to support LLM reasoning. However, it faces challenges like handling multi-hop reasoning, multi-entity questions, multi-source verification, and effective graph utilization. To address these limitations, we present HydraRAG, a training-free framework that unifies graph topology, document semantics, and source reliability to support deep, faithful reasoning in LLMs. HydraRAG handles multi-hop and multi-entity problems through agent-driven exploration that combines structured and unstructured retrieval, increasing both diversity and precision of evidence. To tackle multi-source verification, HydraRAG uses a tri-factor cross-source verification (source trustworthiness assessment,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
