Leveraging AI Graders for Missing Score Imputation to Achieve Accurate Ability Estimation in Constructed-Response Tests

Masaki Uto; Yuma Ito

arXiv:2506.20119·cs.CL·June 26, 2025

Leveraging AI Graders for Missing Score Imputation to Achieve Accurate Ability Estimation in Constructed-Response Tests

Masaki Uto, Yuma Ito

PDF

Open Access

TL;DR

This paper introduces a novel approach that uses AI-based automated scoring to impute missing responses in constructed-response tests, significantly improving ability estimation accuracy and reducing manual grading efforts.

Contribution

It presents a new method combining AI grading with item response theory to accurately estimate abilities from incomplete data, addressing limitations of previous imputation techniques.

Findings

01

High accuracy in ability estimation with AI-based score imputation

02

Significant reduction in manual grading workload

03

Effective handling of sparse and heterogeneous data

Abstract

Evaluating the abilities of learners is a fundamental objective in the field of education. In particular, there is an increasing need to assess higher-order abilities such as expressive skills and logical thinking. Constructed-response tests such as short-answer and essay-based questions have become widely used as a method to meet this demand. Although these tests are effective, they require substantial manual grading, making them both labor-intensive and costly. Item response theory (IRT) provides a promising solution by enabling the estimation of ability from incomplete score data, where human raters grade only a subset of answers provided by learners across multiple test items. However, the accuracy of ability estimation declines as the proportion of missing scores increases. Although data augmentation techniques for imputing missing scores have been explored in order to address this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEducational Technology and Assessment · Machine Learning and ELM