Evidence-based Distributional Alignment for Large Language Models

Viet-Thanh Pham; Lizhen Qu; Zhuang Li; Gholamreza Haffari

arXiv:2603.13305·cs.LG·March 17, 2026

Evidence-based Distributional Alignment for Large Language Models

Viet-Thanh Pham, Lizhen Qu, Zhuang Li, Gholamreza Haffari

PDF

Open Access

TL;DR

Evi-DA is a novel method that enhances large language models' ability to predict accurate, culturally-aware answer distributions by leveraging survey data and reinforcement learning, outperforming existing approaches especially under domain shifts.

Contribution

The paper introduces Evi-DA, a new evidence-based alignment technique that improves distributional predictions of LLMs using survey data and reinforcement learning, addressing instability and bias issues.

Findings

01

Reduces Jensen-Shannon divergence by up to 44%

02

Improves robustness under domain and cultural shifts

03

Enhances distribution fidelity across benchmarks

Abstract

Distributional alignment enables large language models (LLMs) to predict how a target population distributes its responses across answer options, rather than collapsing disagreement into a single consensus answer. However, existing LLM-based distribution prediction is often unstable and degrades under cultural and domain shift. Token score-based estimates can change with minor option wording or formatting, response sampling-based estimates are expensive and sensitive to prompts and decoding settings, and directly generated distributions are frequently miscalibrated. We propose Evi-DA, an evidence-based alignment technique that improves the fidelity and robustness of LLM-based distribution estimation under domain and cultural shift. Given a target country and a multiple-choice question, Evi-DA retrieves related World Values Survey items and their answer distributions, predicts a coarse…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods · Topic Modeling · Expert finding and Q&A systems