SRAG: RAG with Structured Data Improves Vector Retrieval

Shalin Shah; Srikanth Ryali; Ramasubbu Venkatesh

arXiv:2603.26670·cs.IR·March 31, 2026

SRAG: RAG with Structured Data Improves Vector Retrieval

Shalin Shah, Srikanth Ryali, Ramasubbu Venkatesh

PDF

TL;DR

SRAG enhances vector retrieval for LLMs by incorporating structured data like topics, sentiments, and knowledge graph triples, significantly improving answer quality in question answering systems.

Contribution

The paper introduces Structured RAG (SRAG), a novel method that adds structured information to improve retrieval accuracy in RAG systems.

Findings

01

30% improvement in answer scoring with GPT-5 as judge

02

Significant gains in comparative, analytical, and predictive questions

03

Broader, more diverse retrieval with minimal losses in tail risk analysis

Abstract

Retrieval Augmented Generation (RAG) provides the necessary informational grounding to LLMs in the form of chunks retrieved from a vector database or through web search. RAG could also use knowledge graph triples as a means of providing factual information to an LLM. However, the retrieval is only based on representational similarity between a question and the contents. The performance of RAG depends on the numeric vector representations of the query and the chunks. To improve these representations, we propose Structured RAG (SRAG), which adds structured information to a query as well as the chunks in the form of topics, sentiments, query and chunk types (e.g., informational, quantitative), knowledge graph triples and semantic tags. Experiments indicate that this method significantly improves the retrieval process. Using GPT-5 as an LLM-as-a-judge, results show that the method improves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.