Evidence-based Distributional Alignment for Large Language Models
Viet-Thanh Pham, Lizhen Qu, Zhuang Li, Gholamreza Haffari

TL;DR
Evi-DA is a novel method that enhances large language models' ability to predict accurate, culturally-aware answer distributions by leveraging survey data and reinforcement learning, outperforming existing approaches especially under domain shifts.
Contribution
The paper introduces Evi-DA, a new evidence-based alignment technique that improves distributional predictions of LLMs using survey data and reinforcement learning, addressing instability and bias issues.
Findings
Reduces Jensen-Shannon divergence by up to 44%
Improves robustness under domain and cultural shifts
Enhances distribution fidelity across benchmarks
Abstract
Distributional alignment enables large language models (LLMs) to predict how a target population distributes its responses across answer options, rather than collapsing disagreement into a single consensus answer. However, existing LLM-based distribution prediction is often unstable and degrades under cultural and domain shift. Token score-based estimates can change with minor option wording or formatting, response sampling-based estimates are expensive and sensitive to prompts and decoding settings, and directly generated distributions are frequently miscalibrated. We propose Evi-DA, an evidence-based alignment technique that improves the fidelity and robustness of LLM-based distribution estimation under domain and cultural shift. Given a target country and a multiple-choice question, Evi-DA retrieves related World Values Survey items and their answer distributions, predicts a coarse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Topic Modeling · Expert finding and Q&A systems
