Improving item pool utilization for health professions examinations under variable-length computerized adaptive testing designs: a shadow-test approach
Hwanggyu Lim, Kyung (Chris) Tyek Han

TL;DR
This paper introduces new algorithms to improve the efficiency and sustainability of adaptive testing in health professions exams while maintaining content validity.
Contribution
The study proposes and validates new algorithms that significantly improve item pool utilization in shadow-test adaptive testing frameworks.
Findings
Modification 2 reduced unused items from 35.6% to 5.0% in variable-length shadow CAT.
The proposed methods improved item exposure rates and maintained measurement precision.
The new framework offers a secure and sustainable solution for high-stakes health profession assessments.
Abstract
The shadow-test approach to computerized adaptive testing (CAT) ensures content validity in health professions examinations but may suffer from poor item pool utilization in variable-length designs, increasing operational costs and security risks. This study aimed to address this challenge by developing algorithms that enhance the sustainability of shadow CAT in variable-length design. A simulation study was conducted to evaluate 3 proposed modifications of the α-stratification method designed to improve item pool utilization. These methods, which integrated randomesque selection and multiple-form strategies, were compared with 2 baseline algorithms within a variable-length shadow CAT framework. Performance was assessed in terms of measurement precision, pool utilization, and test efficiency. The proposed modifications significantly outperformed the baseline methods across all…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPsychometric Methodologies and Testing · Student Assessment and Feedback · Clinical Reasoning and Diagnostic Skills
