Addressing Hallucinations in Language Models with Knowledge Graph   Embeddings as an Additional Modality

Viktoriia Chekalina; Anton Razzhigaev; Elizaveta Goncharova; Andrey; Kuznetsov

arXiv:2411.11531·cs.CL·January 15, 2025

Addressing Hallucinations in Language Models with Knowledge Graph Embeddings as an Additional Modality

Viktoriia Chekalina, Anton Razzhigaev, Elizaveta Goncharova, Andrey, Kuznetsov

PDF

Open Access

TL;DR

This paper introduces a method to reduce hallucinations in large language models by integrating knowledge graph embeddings via adapters, improving factual accuracy without retraining the entire model.

Contribution

The authors propose a novel approach that incorporates knowledge graph embeddings into language models using adapters, avoiding fine-tuning and external retrieval.

Findings

01

Improved performance on HaluEval, True-False, and FEVER datasets.

02

Created WikiEntities dataset with 3 million annotated Wikipedia texts.

03

Demonstrated reduced hallucinations in multiple LLMs.

Abstract

In this paper we present an approach to reduce hallucinations in Large Language Models (LLMs) by incorporating Knowledge Graphs (KGs) as an additional modality. Our method involves transforming input text into a set of KG embeddings and using an adapter to integrate these embeddings into the language model space, without relying on external retrieval processes. To facilitate this, we created WikiEntities, a dataset containing over 3 million Wikipedia texts annotated with entities from Wikidata and their corresponding embeddings from PyTorch-BigGraph. This dataset serves as a valuable resource for training Entity Linking models and adapting the described method to various LLMs using specialized adapters. Our method does not require fine-tuning of the language models themselves; instead, we only train the adapter. This ensures that the model's performance on other tasks is not…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Advanced Graph Neural Networks · Big Data and Digital Economy

MethodsLLaMA · Sparse Evolutionary Training · Adapter