Can reasoning models comprehend mathematical problems in Chinese ancient texts? An empirical study based on data from Suanjing Shishu
Chang Liu, Dongbo Wang, Liu liu, Zhixiao Zhao

TL;DR
This paper evaluates how well reasoning models understand and solve ancient Chinese mathematical problems from classical texts, highlighting current limitations and suggesting directions for improvement in cross-cultural AI comprehension.
Contribution
It introduces Guji_MATH, a new benchmark dataset for assessing reasoning models on classical Chinese mathematical problems, and systematically evaluates model performance in this context.
Findings
Models can partially solve ancient Chinese math problems
Performance is below modern math benchmarks
Enhancing cultural and language understanding can improve results
Abstract
This study addresses the challenges in intelligent processing of Chinese ancient mathematical classics by constructing Guji_MATH, a benchmark for evaluating classical texts based on Suanjing Shishu. It systematically assesses the mathematical problem-solving capabilities of mainstream reasoning models under the unique linguistic constraints of classical Chinese. Through machine-assisted annotation and manual verification, 538 mathematical problems were extracted from 8 canonical texts, forming a structured dataset centered on the "Question-Answer-Solution" framework, supplemented by problem types and difficulty levels. Dual evaluation modes--closed-book (autonomous problem-solving) and open-book (reproducing classical solution methods)--were designed to evaluate the performance of six reasoning models on ancient Chinese mathematical problems. Results indicate that reasoning models can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHistory and Theory of Mathematics · Mathematics Education and Teaching Techniques · Cognitive and developmental aspects of mathematical skills
