Improving Factuality in Large Language Models via Decoding-Time   Hallucinatory and Truthful Comparators

Dingkang Yang; Dongling Xiao; Jinjie Wei; Mingcheng Li; Zhaoyu Chen,; Ke Li; Lihua Zhang

arXiv:2408.12325·cs.CL·January 28, 2025

Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators

Dingkang Yang, Dongling Xiao, Jinjie Wei, Mingcheng Li, Zhaoyu Chen,, Ke Li, Lihua Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a decoding-time framework using comparators to reduce hallucinations in large language models, improving factual accuracy without altering internal model parameters.

Contribution

It proposes a novel CDT framework with instruction-guided comparators to enhance factuality during decoding, addressing hallucination patterns across tasks.

Findings

01

Significant improvement in response factuality across multiple tasks

02

Effective contrastive decoding constrains hallucination patterns

03

Comparator-driven decoding enhances model reliability

Abstract

Despite their remarkable capabilities, Large Language Models (LLMs) are prone to generate responses that contradict verifiable facts, i.e., unfaithful hallucination content. Existing efforts generally focus on optimizing model parameters or editing semantic representations, which compromise the internal factual knowledge of target LLMs. In addition, hallucinations typically exhibit multifaceted patterns in downstream tasks, limiting the model's holistic performance across tasks. In this paper, we propose a Comparator-driven Decoding-Time (CDT) framework to alleviate the response hallucination. Firstly, we construct hallucinatory and truthful comparators with multi-task fine-tuning samples. In this case, we present an instruction prototype-guided mixture of experts strategy to enhance the ability of the corresponding comparators to capture different hallucination or truthfulness patterns…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ydk122024/cdt
jaxOfficial

Videos

Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Misinformation and Its Impacts

MethodsFocus