Cross-Lingual Consensus: Aligning Multilingual Cultural Knowledge via Multilingual Self-Consistency

Andrew Ivan Soegeng; Patrick Sutanto; Tan Sang Nguyen

arXiv:2605.22137·cs.CL·May 22, 2026

Cross-Lingual Consensus: Aligning Multilingual Cultural Knowledge via Multilingual Self-Consistency

Andrew Ivan Soegeng, Patrick Sutanto, Tan Sang Nguyen

PDF

TL;DR

This paper introduces a self-supervised framework that enhances cross-lingual cultural knowledge alignment in LLMs, reducing Western bias and improving performance across languages by leveraging multilingual self-consistency and self-critique.

Contribution

It proposes a novel multilingual self-consistency approach to surface and transfer cultural knowledge across languages in LLMs, improving cultural alignment without external data.

Findings

01

Improves cultural alignment performance on English queries by 5.03% on BLEnD benchmark.

02

Leverages multilingual self-consistency to identify reliable responses across languages.

03

Uses a self-critique mechanism to transfer cultural knowledge to weaker languages.

Abstract

Although Large Language Models (LLMs) demonstrate strong capabilities across various tasks, they exhibit significant performance discrepancies across languages. While prompting LLMs in English typically yields the highest general performance, it often induces a Western-centric bias, hindering the model's ability to accurately reflect diverse cultural knowledge. We hypothesize that LLMs already possess rich cultural knowledge embedded within local-language representations, but fail to retrieve it when prompted in English. To bridge this cross-lingual knowledge gap, we propose a novel self-supervised framework. Our method leverages multilingual self-consistency to identify the most reliable cultural responses across languages, combined with a self-critique mechanism to transfer this knowledge to the weaker language. Evaluations on the BLEnD benchmark demonstrate that our approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.