Item Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection
Yunxiao Chen, Yi-Hsuan Lee, Xiaoou Li

TL;DR
This paper develops a statistical framework for sequentially detecting abrupt changes in test item properties in educational testing, balancing false alarms and missed detections, using a Bayesian change point model and item response theory.
Contribution
It introduces a novel multi-stream Bayesian change point model and sequential decision rules for monitoring item quality, accounting for population changes and controlling compound risk.
Findings
The proposed method effectively detects item changes in simulated operational test settings.
The decision rules balance false detection and missed detection rates.
The framework is applicable to real-time quality control in educational assessments.
Abstract
In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this paper, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric properties, where a change can be caused by, for example, leakage of the item or change of the corresponding curriculum. We propose a statistical framework for the detection of abrupt changes in individual items. This framework consists of (1) a multi-stream Bayesian change point model describing sequential changes in items, (2) a compound risk function quantifying the risk in sequential decisions, and (3) sequential decision rules that control the compound risk. Throughout the sequential decision process, the proposed decision rule balances the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Psychometric Methodologies and Testing · Advanced Causal Inference Techniques
