Edu-Values: Towards Evaluating the Chinese Education Values of Large Language Models

Peiyi Zhang; Yazhou Zhang; Bo Wang; Lu Rong; Prayag Tiwari; Jing Qin

arXiv:2409.12739·cs.CL·March 24, 2026·2 cites

Edu-Values: Towards Evaluating the Chinese Education Values of Large Language Models

Peiyi Zhang, Yazhou Zhang, Bo Wang, Lu Rong, Prayag Tiwari, Jing Qin

PDF

Open Access

TL;DR

Edu-Values is a comprehensive Chinese education values benchmark for evaluating large language models, revealing cultural influences, challenges in ethics, and improving alignment through external knowledge integration.

Contribution

This paper introduces Edu-Values, the first Chinese education values benchmark with diverse question types, and demonstrates its effectiveness in evaluating and enhancing LLMs.

Findings

01

Chinese LLMs outperform English LLMs due to cultural differences

02

LLMs struggle with teachers' ethics and philosophy

03

Using Edu-Values for external knowledge improves LLM alignment

Abstract

In this paper, we present Edu-Values, the first Chinese education values evaluation benchmark that includes seven core values: professional philosophy, teachers' professional ethics, education laws and regulations, cultural literacy, educational knowledge and skills, basic competencies and subject knowledge. We meticulously design 1,418 questions, covering multiple-choice, multi-modal question answering, subjective analysis, adversarial prompts, and Chinese traditional culture (short answer) questions. We conduct human feedback based automatic evaluation over 21 state-of-the-art (SoTA) LLMs, and highlight three main findings: (1) due to differences in educational culture, Chinese LLMs outperform English LLMs, with Qwen 2 ranking the first with a score of 81.37; (2) LLMs often struggle with teachers' professional ethics and professional philosophy; (3) leveraging Edu-Values to build an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods