Training-Free Cultural Alignment of Large Language Models via Persona Disagreement

Huynh Trung Kiet; Dao Sy Duy Minh; Tuan Nguyen; Chi-Nguyen Tran; Phu-Hoa Pham; Nguyen Lam Phu Quy; The Anh Han; Long Tran-Thanh

arXiv:2605.10843·cs.CL·May 19, 2026

Training-Free Cultural Alignment of Large Language Models via Persona Disagreement

Huynh Trung Kiet, Dao Sy Duy Minh, Tuan Nguyen, Chi-Nguyen Tran, Phu-Hoa Pham, Nguyen Lam Phu Quy, The Anh Han, Long Tran-Thanh

PDF

TL;DR

This paper introduces DISCA, a training-free method for aligning large language models with diverse cultural preferences by leveraging sociodemographic disagreement among persona agents during inference.

Contribution

DISCA is a novel inference-time calibration technique that uses persona disagreement to improve cultural alignment without fine-tuning or access to internal model details.

Findings

01

DISCA reduces cultural misalignment by 10-24% across multiple models and countries.

02

The method works without changing model weights, only during inference.

03

It is effective across various open-weight language models and scenarios.

Abstract

Large language models increasingly mediate decisions that turn on moral judgement, yet a growing body of evidence shows that their implicit preferences are not culturally neutral. Existing cultural alignment methods either require per-country preference data and fine-tuning budgets or assume white-box access to model internals that commercial APIs do not expose. In this work, we focus on this realistic black-box, public-data-only regime and observe that within-country sociodemographic disagreement, not consensus, is the primary steering signal. We introduce DISCA (Disagreement-Informed Steering for Cultural Alignment), an inference-time method that instantiates each country as a panel of World-Values-Survey-grounded persona agents and converts their disagreement into a bounded, loss-averse logit correction. Across 20 countries and 7 open-weight backbones (2B--70B), DISCA reduces…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.