LiteSemRAG: Lightweight LLM-Free Semantic-Aware Graph Retrieval for Robust RAG

Xiao Yue; Guangzhi Qu; Lige Gan

arXiv:2604.16350·cs.IR·April 21, 2026

LiteSemRAG: Lightweight LLM-Free Semantic-Aware Graph Retrieval for Robust RAG

Xiao Yue, Guangzhi Qu, Lige Gan

PDF

TL;DR

LiteSemRAG is a novel lightweight, LLM-free semantic-aware graph retrieval framework that enhances reasoning and evidence aggregation while significantly reducing computational costs.

Contribution

It introduces a fully LLM-free semantic graph construction and retrieval method with dynamic polysemy modeling and efficient two-step retrieval process.

Findings

01

Achieves the best MRR@10 across three benchmarks.

02

Outperforms LLM-based graph RAG systems in recall@10.

03

Consumes zero LLM tokens, improving efficiency.

Abstract

Graph-based Retrieval-Augmented Generation (RAG) has shown great potential for improving multi-level reasoning and structured evidence aggregation. However, existing graph-based RAG frameworks heavily rely on exploiting large language models (LLMs) during indexing and querying, leading to high token consumption, computational cost and latency overhead. In this paper, we propose LiteSemRAG, a lightweight, fully LLM-free, semantic-aware graph retrieval framework. LiteSemRAG constructs a heterogeneous semantic graph by exploiting contextual token-level embeddings, explicitly separating surface lexical representations from context-dependent semantic meanings. To robustly model polysemy, we introduce a dynamic semantic node construction mechanism with chunk-level context aggregation and adaptive anomaly handling. At query stage, LiteSemRAG performs a two-step semantic-aware retrieval process…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.