Scalable Generation and Validation of Isomorphic Physics Problems with GenAI
Naiming Liu, Leo Murch, Spencer Moore, Tong Wan, Shashank Sonkar, Richard Baraniuk, Zhongzhou Chen

TL;DR
This paper introduces a scalable framework using Generative AI to create and validate large banks of isomorphic physics problems for asynchronous assessments, ensuring consistent difficulty and diverse contexts.
Contribution
The work presents a novel prompt chaining approach for generating structurally varied physics problems and evaluates their difficulty using open-source language models and student data.
Findings
73% of generated problem banks are statistically homogeneous in difficulty.
Language models' performance correlates with student performance (up to 0.594 Pearson's rho).
Model scale impacts validation effectiveness, with mid-sized models being optimal.
Abstract
Traditional synchronous STEM assessments face growing challenges including accessibility barriers, security concerns from resource-sharing platforms, and limited comparability across institutions. We present a framework for generating and evaluating large-scale isomorphic physics problem banks using Generative AI to enable asynchronous, multi-attempt assessments. Isomorphic problems test identical concepts through varied surface features and contexts, providing richer variation than conventional parameterized questions while maintaining consistent difficulty. Our generation framework employs prompt chaining and tool use to achieve precise control over structural variations (numeric values, spatial relations) alongside diverse contextual variations. For pre-deployment validation, we evaluate generated items using 17 open-source language models (LMs) (0.6B-32B) and compare against actual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTeaching and Learning Programming · Intelligent Tutoring Systems and Adaptive Learning · Science Education and Pedagogy
