Knowledge Graph-based Retrieval-Augmented Generation for Schema Matching
Chuangtao Ma, Sriom Chakrabarti, Arijit Khan, B\'alint Moln\'ar

TL;DR
This paper introduces KG-RAG4SM, a novel knowledge graph-based retrieval-augmented generation model that enhances schema matching accuracy by leveraging external large knowledge graphs, outperforming state-of-the-art methods without retraining.
Contribution
The paper proposes a new retrieval-augmented generation approach using knowledge graphs for schema matching, addressing semantic ambiguities and hallucination issues of LLMs.
Findings
KG-RAG4SM outperforms SOTA methods by up to 35.89% in precision.
The approach effectively mitigates hallucination problems in LLM-based schema matching.
It demonstrates scalability and efficiency in real-world large knowledge graphs.
Abstract
Traditional similarity-based schema matching methods are incapable of resolving semantic ambiguities and conflicts in domain-specific complex mapping scenarios due to missing commonsense and domain-specific knowledge. The hallucination problem of large language models (LLMs) also makes it challenging for LLM-based schema matching to address the above issues. Therefore, we propose a Knowledge Graph-based Retrieval-Augmented Generation model for Schema Matching, referred to as the KG-RAG4SM. In particular, KG-RAG4SM introduces novel vector-based, graph traversal-based, and query-based graph retrievals, as well as a hybrid approach and ranking schemes that identify the most relevant subgraphs from external large knowledge graphs (KGs). We showcase that KG-based retrieval-augmented LLMs are capable of generating more accurate results for complex matching cases without any re-training. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Graph Theory and Algorithms
