A derivation of the optimal answer-copying index and some applications
Mauricio Romero, Alvaro Riascos, Diego Jara

TL;DR
This paper derives an optimal statistical index for detecting answer-copying in multiple-choice exams, demonstrating its superior power through simulations and real data analysis, and proposing a correction method for large-scale cheating detection.
Contribution
It introduces a mathematically supported, optimal answer-copying index based on the UMP, validated with real exam data and simulations, improving cheating detection accuracy.
Findings
The derived index outperforms existing indices in power while maintaining the same type-I error.
Stricter proctoring correlates with lower copying levels in exams.
A Bonferroni correction effectively detects widespread cheating.
Abstract
Multiple-choice exams are frequently used as an efficient and objective method to assess learning but they are more vulnerable to answer-copying than tests based on open questions. Several statistical tests (known as indices in the literature) have been proposed to detect cheating; however, to the best of our knowledge they all lack mathematical support that guarantees optimality in any sense. We partially fill this void by deriving the uniform most powerful (UMP) under the assumption that the response distribution is known. In practice, however, we must estimate a behavioral model that yields a response distribution for each question. We calculate the empirical type-I and type-II error rates for several indices that assume different behavioral models using simulations based on real data from twelve nationwide multiple-choice exams taken by 5th and 9th graders in Colombia. We find that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Reliability and Agreement in Measurement · Machine Learning and Data Classification
