Truth or Deceit? A Bayesian Decoding Game Enhances Consistency and Reliability
Weitong Zhang, Chengqi Zang, Bernhard Kainz

TL;DR
This paper introduces a Bayesian game-theoretic decoding method for large language models that improves output consistency and reliability without extra training, outperforming larger models in truthfulness and coherence.
Contribution
It presents a novel multistage Bayesian decoding game that enhances LLM output consistency and reliability through dynamic consensus and ambiguity calibration, without human feedback.
Findings
Smaller models outperform larger ones in truthfulness (e.g., LLaMA13B vs PaLM540B).
The method improves consistency and reliability in LLM outputs.
The approach integrates various strategies and models using game-theoretic tools.
Abstract
Large Language Models (LLMs) often produce outputs that -- though plausible -- can lack consistency and reliability, particularly in ambiguous or complex scenarios. Challenges arise from ensuring that outputs align with both factual correctness and human intent. This is problematic in existing approaches that trade improved consistency for lower accuracy. To mitigate these challenges, we propose a novel game-theoretic approach to enhance consistency and reliability during the decoding stage of LLM output generation. Our method models the decoding process as a multistage Bayesian decoding game. This ensures consistency through Correctness Alignment and enhances reliability via Ambiguity Calibration. The model dynamically converges to a consensus on the most reliable outputs and distinguishes {Valid, Specious} outputs without human feedback or additional training. Our game design allows…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications
MethodsALIGN
