A Cross-Cultural Comparison of LLM-based Public Opinion Simulation: Evaluating Chinese and U.S. Models on Diverse Societies

Weihong Qi; Fan Huang; Jisun An; Haewoon Kwak

arXiv:2506.21587·cs.CL·September 16, 2025

A Cross-Cultural Comparison of LLM-based Public Opinion Simulation: Evaluating Chinese and U.S. Models on Diverse Societies

Weihong Qi, Fan Huang, Jisun An, Haewoon Kwak

PDF

Open Access

TL;DR

This study compares the ability of various large language models to simulate public opinions across Chinese and U.S. societies, revealing strengths and limitations in cultural and issue-specific contexts, and emphasizing the need to address demographic biases.

Contribution

It provides a cross-cultural evaluation of LLMs' effectiveness in public opinion simulation, highlighting their comparative performance and biases in Chinese and U.S. societal contexts.

Findings

01

DeepSeek-V3 best predicts U.S. opinions on abortion.

02

Models tend to overgeneralize responses within demographic groups.

03

All models show biases and limitations in capturing cultural nuances.

Abstract

This study evaluates the ability of DeepSeek, an open-source large language model (LLM), to simulate public opinions in comparison to LLMs developed by major tech companies. By comparing DeepSeek-R1 and DeepSeek-V3 with Qwen2.5, GPT-4o, and Llama-3.3 and utilizing survey data from the American National Election Studies (ANES) and the Zuobiao dataset of China, we assess these models' capacity to predict public opinions on social issues in both China and the United States, highlighting their comparative capabilities between countries. Our findings indicate that DeepSeek-V3 performs best in simulating U.S. opinions on the abortion issue compared to other topics such as climate change, gun control, immigration, and services for same-sex couples, primarily because it more accurately simulates responses when provided with Democratic or liberal personas. For Chinese samples, DeepSeek-V3…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods