Trustful LLMs: Customizing and Grounding Text Generation with Knowledge   Bases and Dual Decoders

Xiaofeng Zhu; Jaya Krishna Mandivarapu

arXiv:2411.07870·cs.CL·December 23, 2024

Trustful LLMs: Customizing and Grounding Text Generation with Knowledge Bases and Dual Decoders

Xiaofeng Zhu, Jaya Krishna Mandivarapu

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel approach to improve the groundedness and correctness of large language models by using knowledge bases and dual decoders to correct hallucinations and ensure domain relevance.

Contribution

It proposes a post-processing correction algorithm and a dual-decoder model that effectively incorporate RAG context to enhance LLM output accuracy and domain grounding.

Findings

01

The correction algorithm reduces hallucinations in generated content.

02

The dual-decoder model improves the factual accuracy of LLM outputs.

03

Enhanced grounding leads to more reliable domain-specific text generation.

Abstract

Although people are impressed by the content generation skills of large language models, the use of LLMs, such as ChatGPT, is limited by the domain grounding of the content. The correctness and groundedness of the generated content need to be based on a verified context, such as results from Retrieval-Augmented Generation (RAG). One important issue when adapting LLMs to a customized domain is that the generated responses are often incomplete, or the additions are not verified and may even be hallucinated. Prior studies on hallucination detection have focused on evaluation metrics, which are not easily adaptable to dynamic domains and can be vulnerable to attacks like jail-breaking. In this work, we propose 1) a post-processing algorithm that leverages knowledge triplets in RAG context to correct hallucinations and 2) a dual-decoder model that fuses RAG context to guide the generation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Trustful LLMs: Customizing and Grounding Text Generation with knowledge bases and Dual Decoders· underline

Taxonomy

TopicsNatural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Dropout · Linear Warmup With Linear Decay · WordPiece · Dense Connections · Layer Normalization · Adam · Attention Dropout