Distilling Bayesian Belief States into Language Models for Auditable Negotiation
Zongqi Cui, Baihan Lin

TL;DR
This paper introduces BOND, a framework that distills Bayesian opponent belief states into language models for auditable negotiation, enabling transparent belief inference and decision-making in dialogue agents.
Contribution
BOND combines a Bayesian teacher with a smaller student model to produce interpretable belief states and improve auditable negotiation capabilities over existing methods.
Findings
BOND outperforms state-of-the-art on the CaSiNo dataset with a mean Brier score of 0.085.
The distilled student maintains belief signals with a Brier score of 0.114, better than uniform priors.
Smaller models yield better calibration of elicited posterior beliefs compared to larger baselines.
Abstract
Negotiation agents must infer what their counterpart values, update those beliefs over dialogue turns, and choose actions under uncertainty. End-to-end large language models (LLMs) can imitate negotiation dialogue, but their opponent beliefs are usually implicit and difficult to inspect. We propose BOND (Bayesian Opponent-belief Negotiation Distillation), a framework for auditable negotiation. BOND consists of an LLM-based Bayesian teacher that scores dialogue contexts against the six possible opponent priority orderings, updates a posterior over those orderings, and uses the posterior for menu-based decision making, as well as a smaller 8B student language model that emits both negotiation actions and normalized posterior beliefs as tagged text. In the CaSiNo negotiation dataset, BOND outperforms the state-of-the-art and achieves mean Brier score 0.085 over opponent-priority…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
