Combining LLMs and Knowledge Graphs to Reduce Hallucinations in Question Answering

Larissa Pusch; Tim O. F. Conrad

arXiv:2409.04181·cs.CL·November 12, 2025·3 cites

Combining LLMs and Knowledge Graphs to Reduce Hallucinations in Question Answering

Larissa Pusch, Tim O. F. Conrad

PDF

Open Access

TL;DR

This paper introduces a hybrid method combining Large Language Models and Knowledge Graphs to reduce hallucinations in biomedical question-answering, enhancing accuracy and reliability with a new benchmark and user interface.

Contribution

It presents a novel approach integrating LLMs and KGs with a query checker, significantly reducing hallucinations in biomedical QA systems.

Findings

01

GPT-4 Turbo outperforms other models in accuracy

02

Open-source llama3:70b shows promise with prompt engineering

03

The approach effectively reduces hallucinations and data gaps

Abstract

Advancements in natural language processing have revolutionized the way we can interact with digital information systems, such as databases, making them more accessible. However, challenges persist, especially when accuracy is critical, as in the biomedical domain. A key issue is the hallucination problem, where models generate information unsupported by the underlying data, potentially leading to dangerous misinformation. This paper presents a novel approach designed to bridge this gap by combining Large Language Models (LLM) and Knowledge Graphs (KG) to improve the accuracy and reliability of question-answering systems, on the example of a biomedical KG. Built on the LangChain framework, our method incorporates a query checker that ensures the syntactical and semantic validity of LLM-generated queries, which are then used to extract information from a Knowledge Graph, substantially…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Seismology and Earthquake Studies · Machine Learning in Healthcare

MethodsByte Pair Encoding · Absolute Position Encodings · Softmax · Label Smoothing · Layer Normalization · Dropout · Attention Is All You Need · Position-Wise Feed-Forward Layer · Residual Connection · Linear Layer