Leveraging AI Graders for Missing Score Imputation to Achieve Accurate Ability Estimation in Constructed-Response Tests
Masaki Uto, Yuma Ito

TL;DR
This paper introduces a novel approach that uses AI-based automated scoring to impute missing responses in constructed-response tests, significantly improving ability estimation accuracy and reducing manual grading efforts.
Contribution
It presents a new method combining AI grading with item response theory to accurately estimate abilities from incomplete data, addressing limitations of previous imputation techniques.
Findings
High accuracy in ability estimation with AI-based score imputation
Significant reduction in manual grading workload
Effective handling of sparse and heterogeneous data
Abstract
Evaluating the abilities of learners is a fundamental objective in the field of education. In particular, there is an increasing need to assess higher-order abilities such as expressive skills and logical thinking. Constructed-response tests such as short-answer and essay-based questions have become widely used as a method to meet this demand. Although these tests are effective, they require substantial manual grading, making them both labor-intensive and costly. Item response theory (IRT) provides a promising solution by enabling the estimation of ability from incomplete score data, where human raters grade only a subset of answers provided by learners across multiple test items. However, the accuracy of ability estimation declines as the proportion of missing scores increases. Although data augmentation techniques for imputing missing scores have been explored in order to address this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducational Technology and Assessment · Machine Learning and ELM
