Addressing Hallucinations in Language Models with Knowledge Graph Embeddings as an Additional Modality
Viktoriia Chekalina, Anton Razzhigaev, Elizaveta Goncharova, Andrey, Kuznetsov

TL;DR
This paper introduces a method to reduce hallucinations in large language models by integrating knowledge graph embeddings via adapters, improving factual accuracy without retraining the entire model.
Contribution
The authors propose a novel approach that incorporates knowledge graph embeddings into language models using adapters, avoiding fine-tuning and external retrieval.
Findings
Improved performance on HaluEval, True-False, and FEVER datasets.
Created WikiEntities dataset with 3 million annotated Wikipedia texts.
Demonstrated reduced hallucinations in multiple LLMs.
Abstract
In this paper we present an approach to reduce hallucinations in Large Language Models (LLMs) by incorporating Knowledge Graphs (KGs) as an additional modality. Our method involves transforming input text into a set of KG embeddings and using an adapter to integrate these embeddings into the language model space, without relying on external retrieval processes. To facilitate this, we created WikiEntities, a dataset containing over 3 million Wikipedia texts annotated with entities from Wikidata and their corresponding embeddings from PyTorch-BigGraph. This dataset serves as a valuable resource for training Entity Linking models and adapting the described method to various LLMs using specialized adapters. Our method does not require fine-tuning of the language models themselves; instead, we only train the adapter. This ensures that the model's performance on other tasks is not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Advanced Graph Neural Networks · Big Data and Digital Economy
MethodsLLaMA · Sparse Evolutionary Training · Adapter
