Truth or Deceit? A Bayesian Decoding Game Enhances Consistency and   Reliability

Weitong Zhang; Chengqi Zang; Bernhard Kainz

arXiv:2410.01064·cs.AI·October 3, 2024

Truth or Deceit? A Bayesian Decoding Game Enhances Consistency and Reliability

Weitong Zhang, Chengqi Zang, Bernhard Kainz

PDF

Open Access

TL;DR

This paper introduces a Bayesian game-theoretic decoding method for large language models that improves output consistency and reliability without extra training, outperforming larger models in truthfulness and coherence.

Contribution

It presents a novel multistage Bayesian decoding game that enhances LLM output consistency and reliability through dynamic consensus and ambiguity calibration, without human feedback.

Findings

01

Smaller models outperform larger ones in truthfulness (e.g., LLaMA13B vs PaLM540B).

02

The method improves consistency and reliability in LLM outputs.

03

The approach integrates various strategies and models using game-theoretic tools.

Abstract

Large Language Models (LLMs) often produce outputs that -- though plausible -- can lack consistency and reliability, particularly in ambiguous or complex scenarios. Challenges arise from ensuring that outputs align with both factual correctness and human intent. This is problematic in existing approaches that trade improved consistency for lower accuracy. To mitigate these challenges, we propose a novel game-theoretic approach to enhance consistency and reliability during the decoding stage of LLM output generation. Our method models the decoding process as a multistage Bayesian decoding game. This ensures consistency through Correctness Alignment and enhances reliability via Ambiguity Calibration. The model dynamically converges to a consensus on the most reliable outputs and distinguishes {Valid, Specious} outputs without human feedback or additional training. Our game design allows…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGame Theory and Applications

MethodsALIGN