Unveiling the Competitive Dynamics: A Comparative Evaluation of American and Chinese LLMs
Zhenhui Jiang, Jiaxin Li, Yang Liu

TL;DR
This paper compares American and Chinese Large Language Models across languages and tasks, highlighting performance disparities and emphasizing the need for culturally nuanced development and international collaboration.
Contribution
It introduces a comprehensive evaluation framework for LLMs in multiple languages and assesses 16 models, revealing performance gaps and strategic insights.
Findings
GPT 4-Turbo leads in English contexts
Ernie-Bot 4 excels in Chinese contexts
Performance varies significantly across languages and tasks
Abstract
The strategic significance of Large Language Models (LLMs) in economic expansion, innovation, societal development, and national security has been increasingly recognized since the advent of ChatGPT. This study provides a comprehensive comparative evaluation of American and Chinese LLMs in both English and Chinese contexts. We proposed a comprehensive evaluation framework that encompasses natural language proficiency, disciplinary expertise, and safety and responsibility, and systematically assessed 16 prominent models from the US and China under various operational tasks and scenarios. Our key findings show that GPT 4-Turbo is at the forefront in English contexts, whereas Ernie-Bot 4 stands out in Chinese contexts. The study also highlights disparities in LLM performance across languages and tasks, stressing the necessity for linguistically and culturally nuanced model development. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInnovation and Knowledge Management · International Business and FDI
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · 7 Fastest Ways to Call American Airlines Reservations Number (USA Guide) · Linear Layer · Discriminative Fine-Tuning · Multi-Head Attention · Layer Normalization · Dense Connections · Attention Dropout · Weight Decay
