Bridging Qualitative Rubrics and AI: A Binary Question Framework for Criterion-Referenced Grading in Engineering
Lili Chen, Winn Wing-Yiu Chow, Stella Peng, Bencheng Fan, Sachitha Bandara

TL;DR
This paper presents a novel binary question framework integrating GenAI with criterion-referenced grading to enhance feedback quality and grading accuracy in engineering assessments, showing promising results but highlighting the need for further reliability improvements.
Contribution
Introduces a structured binary question approach combining GenAI with criterion-referenced grading for engineering assessments, improving feedback and accuracy.
Findings
GenAI achieved 92.5% grading accuracy, comparable to experienced human graders.
GenAI provided more complete feedback and improved error detection.
The system enhanced formative feedback but is not yet fully autonomous.
Abstract
PURPOSE OR GOAL: This study investigates how GenAI can be integrated with a criterion-referenced grading framework to improve the efficiency and quality of grading for mathematical assessments in engineering. It specifically explores the challenges demonstrators face with manual, model solution-based grading and how a GenAI-supported system can be designed to reliably identify student errors, provide high-quality feedback, and support human graders. The research also examines human graders' perceptions of the effectiveness of this GenAI-assisted approach. ACTUAL OR ANTICIPATED OUTCOMES: The study found that GenAI achieved an overall grading accuracy of 92.5%, comparable to two experienced human graders. The two researchers, who also served as subject demonstrators, perceived the GenAI as a helpful second reviewer that improved accuracy by catching small errors and provided more complete…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStudent Assessment and Feedback · Mathematics Education and Programs · Mathematics Education and Teaching Techniques
