Beyond Static Scoring: Enhancing Assessment Validity via AI-Generated Interactive Verification

Tom Lee; Sihoon Lee; Seonghun Kim

arXiv:2512.12592·cs.CY·December 16, 2025

Beyond Static Scoring: Enhancing Assessment Validity via AI-Generated Interactive Verification

Tom Lee, Sihoon Lee, Seonghun Kim

PDF

Open Access

TL;DR

This paper proposes a Human-AI collaboration framework that combines automated scoring with AI-generated follow-up questions to improve assessment validity and authenticity beyond static scoring methods.

Contribution

It introduces a novel interactive verification approach that integrates AI-generated questions with automated scoring to enhance assessment validity and detect superficial reasoning.

Findings

01

Stage 1 ensures procedural fairness and consistency.

02

Stage 2 effectively diagnoses superficial reasoning.

03

Instructor perceptions highlight the importance of adaptive questioning.

Abstract

Large Language Models (LLMs) challenge the validity of traditional open-ended assessments by blurring the lines of authorship. While recent research has focused on the accuracy of automated scoring (AES), these static approaches fail to capture process evidence or verify genuine student understanding. This paper introduces a novel Human-AI Collaboration framework that enhances assessment integrity by combining rubric-based automated scoring with AI-generated, targeted follow-up questions. In a pilot study with university instructors (N=9), we demonstrate that while Stage 1 (Auto-Scoring) ensures procedural fairness and consistency, Stage 2 (Interactive Verification) is essential for construct validity, effectively diagnosing superficial reasoning or unverified AI use. We report on the systems design, instructor perceptions of fairness versus validity, and the necessity of adaptive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Artificial Intelligence in Healthcare and Education · Student Assessment and Feedback