Orion-RAG: Path-Aligned Hybrid Retrieval for Graphless Data
Zhen Chen, Weihao Xie, Peilin Chen, Shiqi Wang, Jianping Wang

TL;DR
Orion-RAG introduces a simple, low-complexity method to connect fragmented, discrete data across files, enhancing retrieval-augmented generation with improved accuracy and efficiency in real-world scenarios.
Contribution
It proposes a lightweight path extraction strategy that transforms unstructured documents into semi-structured data, enabling effective cross-file information linking without complex algorithms.
Findings
Outperforms mainstream frameworks across multiple domains.
Achieves 25.2% relative improvement on FinanceBench.
Supports real-time updates and human-in-the-loop verification.
Abstract
Retrieval-Augmented Generation (RAG) has proven effective for knowledge synthesis, yet it encounters significant challenges in practical scenarios where data is inherently discrete and fragmented. In most environments, information is distributed across isolated files like reports and logs that lack explicit links. Standard search engines process files independently, ignoring the connections between them. Furthermore, manually building Knowledge Graphs is impractical for such vast data. To bridge this gap, we present Orion-RAG. Our core insight is simple yet effective: we do not need heavy algorithms to organize this data. Instead, we use a low-complexity strategy to extract lightweight paths that naturally link related concepts. We demonstrate that this streamlined approach suffices to transform fragmented documents into semi-structured data, enabling the system to link information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Information Retrieval and Search Behavior
