Fine-tuning ChatGPT for Automatic Scoring of Written Scientific   Explanations in Chinese

Jie Yang; Ehsan Latif; Yuze He; Xiaoming Zhai

arXiv:2501.06704·cs.AI·January 14, 2025

Fine-tuning ChatGPT for Automatic Scoring of Written Scientific Explanations in Chinese

Jie Yang, Ehsan Latif, Yuze He, Xiaoming Zhai

PDF

TL;DR

This paper explores fine-tuning ChatGPT to automatically score Chinese scientific explanations, revealing how linguistic features and reasoning complexity affect scoring accuracy and demonstrating the model's potential in Chinese educational assessment.

Contribution

It introduces a domain-specific fine-tuning approach for ChatGPT to score Chinese explanations and analyzes how linguistic features influence scoring accuracy.

Findings

01

ChatGPT achieves domain-adapted scoring accuracy for Chinese explanations.

02

Scoring accuracy varies with reasoning complexity, showing negative correlation at low levels.

03

Linguistic features like simplicity and clarity impact scoring precision.

Abstract

The development of explanations for scientific phenomena is essential in science assessment, but scoring student-written explanations remains challenging and resource-intensive. Large language models (LLMs) have shown promise in addressing this issue, particularly in alphabetic languages like English. However, their applicability to logographic languages is less explored. This study investigates the potential of fine-tuning ChatGPT, a leading LLM, to automatically score scientific explanations written in Chinese. Student responses to seven scientific explanation tasks were collected and automatically scored, with scoring accuracy examined in relation to reasoning complexity using the Kendall correlation. A qualitative analysis explored how linguistic features influenced scoring accuracy. The results show that domain-specific adaptation enables ChatGPT to score Chinese scientific…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.