Diagnosing Korean-Language LLM Political Bias via Census-Grounded Agent Simulation
Sungwoo Kang

TL;DR
This paper introduces Dynamo-K, a framework for diagnosing political biases in Korean-language LLMs through simulations, revealing systematic failure modes and proposing calibration methods, with validation on election predictions.
Contribution
The paper presents Dynamo-K, a novel, open-source simulation framework for diagnosing and addressing political biases in Korean-language LLMs, including new calibration techniques.
Findings
Mitigation reduces MAE by 5.2 times in moderate agents.
Scenario reframing recovers 62% of 2017 MAE.
Reweighting adapter calibrates models without candidate names.
Abstract
Large language models (LLMs) exhibit systematic political biases in voter simulations, but their underlying mechanisms and cross-lingual generalizations remain poorly understood. We introduce Dynamo-K, a census-grounded simulation framework evaluating Korean-language LLM political behavior across four models on six Korean elections (2017-2025). Using this framework, we identify three systematic failure modes: (1) progressive bias in moderate agents, where explicit mitigation reduces Mean Absolute Error (MAE) by 5.2 times; (2) model-dependent third-party salience collapse, distinguishing between salience failure and decision bias; and (3) regional polarization collapse, where models bidirectionally under-predict historical party strongholds. To address these failures, we demonstrate that scenario reframing recovers 62% of 2017 MAE by restoring third-party visibility. Furthermore, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
