SRAG: RAG with Structured Data Improves Vector Retrieval
Shalin Shah, Srikanth Ryali, Ramasubbu Venkatesh

TL;DR
SRAG enhances vector retrieval for LLMs by incorporating structured data like topics, sentiments, and knowledge graph triples, significantly improving answer quality in question answering systems.
Contribution
The paper introduces Structured RAG (SRAG), a novel method that adds structured information to improve retrieval accuracy in RAG systems.
Findings
30% improvement in answer scoring with GPT-5 as judge
Significant gains in comparative, analytical, and predictive questions
Broader, more diverse retrieval with minimal losses in tail risk analysis
Abstract
Retrieval Augmented Generation (RAG) provides the necessary informational grounding to LLMs in the form of chunks retrieved from a vector database or through web search. RAG could also use knowledge graph triples as a means of providing factual information to an LLM. However, the retrieval is only based on representational similarity between a question and the contents. The performance of RAG depends on the numeric vector representations of the query and the chunks. To improve these representations, we propose Structured RAG (SRAG), which adds structured information to a query as well as the chunks in the form of topics, sentiments, query and chunk types (e.g., informational, quantitative), knowledge graph triples and semantic tags. Experiments indicate that this method significantly improves the retrieval process. Using GPT-5 as an LLM-as-a-judge, results show that the method improves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
