The performances of the Chinese and U.S. Large Language Models on the Topic of Chinese Culture

Feiyan Liu; Siyan Zhao; Chenxun Zhuo; Tianming Liu; Bao Ge

arXiv:2601.02830·cs.CL·January 8, 2026

The performances of the Chinese and U.S. Large Language Models on the Topic of Chinese Culture

Feiyan Liu, Siyan Zhao, Chenxun Zhuo, Tianming Liu, Bao Ge

PDF

Open Access

TL;DR

This study compares Chinese and U.S. large language models' performance on Chinese cultural questions, revealing Chinese models generally outperform U.S. models, with differences potentially due to training data and localization strategies.

Contribution

It provides a comparative analysis of Chinese and U.S. LLMs on Chinese culture, highlighting performance disparities and possible underlying causes.

Findings

01

Chinese models outperform U.S. models on Chinese cultural tasks

02

Gemini 2.5Pro and GPT-5.1 achieve higher accuracy among U.S. models

03

Performance differences linked to training data and localization strategies

Abstract

Cultural backgrounds shape individuals' perspectives and approaches to problem-solving. Since the emergence of GPT-1 in 2018, large language models (LLMs) have undergone rapid development. To date, the world's ten leading LLM developers are primarily based in China and the United States. To examine whether LLMs released by Chinese and U.S. developers exhibit cultural differences in Chinese-language settings, we evaluate their performance on questions about Chinese culture. This study adopts a direct-questioning paradigm to evaluate models such as GPT-5.1, DeepSeek-V3.2, Qwen3-Max, and Gemini2.5Pro. We assess their understanding of traditional Chinese culture, including history, literature, poetry, and related domains. Comparative analyses between LLMs developed in China and the U.S. indicate that Chinese models generally outperform their U.S. counterparts on these tasks. Among…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods · Artificial Intelligence in Healthcare and Education · Big Data and Digital Economy