Variable Selection in Latent Regression IRT Models via Knockoffs: An Application to International Large-scale Assessment in Education
Zilong Xie, Yunxiao Chen, Matthias von Davier, Haolei Weng

TL;DR
This paper introduces a novel knockoff-based variable selection method for latent regression IRT models, applied to PISA data, effectively identifying non-cognitive factors linked to student performance while controlling false discovery rates.
Contribution
It develops a new knockoff approach tailored for latent variable models in ILSAs, addressing missing data and multiple comparisons in variable selection.
Findings
Successfully identified non-cognitive variables associated with science performance.
Controlled false discovery rate in variable selection.
Applied method to PISA data with promising results.
Abstract
International large-scale assessments (ILSAs) play an important role in educational research and policy making. They collect valuable data on education quality and performance development across many education systems, giving countries the opportunity to share techniques, organizational structures, and policies that have proven efficient and successful. To gain insights from ILSA data, we identify non-cognitive variables associated with students' academic performance. This problem has three analytical challenges: 1) academic performance is measured by cognitive items under a matrix sampling design; 2) there are many missing values in the non-cognitive variables; and 3) multiple comparisons due to a large number of non-cognitive variables. We consider an application to the Programme for International Student Assessment (PISA), aiming to identify non-cognitive variables associated with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOnline Learning and Analytics
