A Probabilistic Framework for LLM Hallucination Detection via Belief   Tree Propagation

Bairu Hou; Yang Zhang; Jacob Andreas; Shiyu Chang

arXiv:2406.06950·cs.CL·February 11, 2025·1 cites

A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation

Bairu Hou, Yang Zhang, Jacob Andreas, Shiyu Chang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces BTProp, a probabilistic framework that uses belief trees and hidden Markov models to improve hallucination detection in large language models, outperforming existing methods.

Contribution

The paper proposes a novel belief tree propagation framework that effectively integrates continuous belief scores for better hallucination detection in LLMs.

Findings

01

BTProp improves detection performance by 3%-9% on benchmarks.

02

The belief tree structure captures logical relations among statements.

03

Continuous belief integration enhances detection accuracy.

Abstract

This paper focuses on the task of hallucination detection, which aims to determine the truthfulness of LLM-generated statements. To address this problem, a popular class of methods utilize the LLM's self-consistencies in its beliefs in a set of logically related augmented statements generated by the LLM, which does not require external knowledge databases and can work with both white-box and black-box LLMs. However, in many existing approaches, the augmented statements tend to be very monotone and unstructured, which makes it difficult to integrate meaningful information from the LLM beliefs in these statements. Also, many methods work with the binarized version of the LLM's belief, instead of the continuous version, which significantly loses information. To overcome these limitations, in this paper, we propose Belief Tree Propagation (BTProp), a probabilistic framework for LLM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ucsb-nlp-chang/btprop
noneOfficial

Videos

A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation· underline

Taxonomy

TopicsDigital Media Forensic Detection

MethodsSparse Evolutionary Training