Applying IRT to Distinguish Between Human and Generative AI Responses to Multiple-Choice Assessments
Alona Strugatski, Giora Alexandron

TL;DR
This paper introduces a novel application of Item Response Theory to distinguish between human and AI responses in multiple-choice assessments, providing a new tool for detecting AI cheating in education.
Contribution
It presents the first method applying IRT and Person-Fit Statistics to identify AI-generated responses in MCQ tests, highlighting differences in response patterns.
Findings
Effective differentiation between human and AI responses using IRT.
Sensitivity of the method to the amount of AI cheating present.
Chatbots exhibit distinct reasoning profiles.
Abstract
Generative AI is transforming the educational landscape, raising significant concerns about cheating. Despite the widespread use of multiple-choice questions in assessments, the detection of AI cheating in MCQ-based tests has been almost unexplored, in contrast to the focus on detecting AI-cheating on text-rich student outputs. In this paper, we propose a method based on the application of Item Response Theory to address this gap. Our approach operates on the assumption that artificial and human intelligence exhibit different response patterns, with AI cheating manifesting as deviations from the expected patterns of human responses. These deviations are modeled using Person-Fit Statistics. We demonstrate that this method effectively highlights the differences between human responses and those generated by premium versions of leading chatbots (ChatGPT, Claude, and Gemini), but that it is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Cognitive Science and Mapping · Impact of AI and Big Data on Business and Society
MethodsFocus
