HSKBenchmark: Modeling and Benchmarking Chinese Second Language Acquisition in Large Language Models through Curriculum Tuning
Qihao Yang, Xuelin Wang, Jiale Chen, Xuelian Dong, Yuxin Hao, Tianyong Hao

TL;DR
This paper introduces HSKBenchmark, a comprehensive Chinese SLA benchmark for LLMs, along with curriculum tuning and an evaluation system, enabling models to simulate human-like language acquisition and achieve performance comparable to advanced learners.
Contribution
It presents the first staged Chinese SLA benchmark for LLMs, a curriculum-tuning framework, and an evaluation system to assess language acquisition progress and writing quality.
Findings
HSKBenchmark effectively models Chinese SLA in LLMs.
Fine-tuned models reach human-level writing performance.
The benchmark supports dynamic assessment of language learning stages.
Abstract
Language acquisition is vital to revealing the nature of human language intelligence and has recently emerged as a promising perspective for improving the interpretability of large language models (LLMs). However, it is ethically and practically infeasible to conduct experiments that require controlling human learners' language inputs. This poses challenges for the verifiability and scalability of language acquisition modeling, particularly in Chinese second language acquisition (SLA). While LLMs provide a controllable and reproducible alternative, a systematic benchmark to support phase-wise modeling and assessment is still lacking. In this paper, we present HSKBenchmark, the first benchmark for staged modeling and writing assessment of LLMs in Chinese SLA. It covers HSK levels 3 to 6 and includes authentic textbooks with 6.76 million tokens, 16K synthetic instruction samples, 30 test…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsText Readability and Simplification · Topic Modeling · Natural Language Processing Techniques
