LiteSemRAG: Lightweight LLM-Free Semantic-Aware Graph Retrieval for Robust RAG
Xiao Yue, Guangzhi Qu, Lige Gan

TL;DR
LiteSemRAG is a novel lightweight, LLM-free semantic-aware graph retrieval framework that enhances reasoning and evidence aggregation while significantly reducing computational costs.
Contribution
It introduces a fully LLM-free semantic graph construction and retrieval method with dynamic polysemy modeling and efficient two-step retrieval process.
Findings
Achieves the best MRR@10 across three benchmarks.
Outperforms LLM-based graph RAG systems in recall@10.
Consumes zero LLM tokens, improving efficiency.
Abstract
Graph-based Retrieval-Augmented Generation (RAG) has shown great potential for improving multi-level reasoning and structured evidence aggregation. However, existing graph-based RAG frameworks heavily rely on exploiting large language models (LLMs) during indexing and querying, leading to high token consumption, computational cost and latency overhead. In this paper, we propose LiteSemRAG, a lightweight, fully LLM-free, semantic-aware graph retrieval framework. LiteSemRAG constructs a heterogeneous semantic graph by exploiting contextual token-level embeddings, explicitly separating surface lexical representations from context-dependent semantic meanings. To robustly model polysemy, we introduce a dynamic semantic node construction mechanism with chunk-level context aggregation and adaptive anomaly handling. At query stage, LiteSemRAG performs a two-step semantic-aware retrieval process…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
