SemRAG: Semantic Knowledge-Augmented RAG for Improved Question-Answering

Kezhen Zhong; Basem Suleiman; Abdelkarim Erradi; Shijing Chen

arXiv:2507.21110·cs.CL·July 30, 2025

SemRAG: Semantic Knowledge-Augmented RAG for Improved Question-Answering

Kezhen Zhong, Basem Suleiman, Abdelkarim Erradi, Shijing Chen

PDF

TL;DR

SemRAG enhances retrieval-augmented generation by integrating semantic chunking and knowledge graphs, significantly improving domain-specific question-answering without extensive fine-tuning, thus offering a scalable and resource-efficient solution.

Contribution

Introduces SemRAG, a novel framework combining semantic chunking and knowledge graphs to improve retrieval accuracy in domain-specific LLM applications without heavy fine-tuning.

Findings

01

Outperforms traditional RAG on MultiHop and Wikipedia datasets

02

Enhances retrieval relevance and correctness

03

Optimizes buffer sizes for better performance

Abstract

This paper introduces SemRAG, an enhanced Retrieval Augmented Generation (RAG) framework that efficiently integrates domain-specific knowledge using semantic chunking and knowledge graphs without extensive fine-tuning. Integrating domain-specific knowledge into large language models (LLMs) is crucial for improving their performance in specialized tasks. Yet, existing adaptations are computationally expensive, prone to overfitting and limit scalability. To address these challenges, SemRAG employs a semantic chunking algorithm that segments documents based on the cosine similarity from sentence embeddings, preserving semantic coherence while reducing computational overhead. Additionally, by structuring retrieved information into knowledge graphs, SemRAG captures relationships between entities, improving retrieval accuracy and contextual understanding. Experimental results on MultiHop RAG…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.