Robust Estimation of Item Parameters via Divergence Measures in Item Response Theory
Yuki Itaya, Kenichi Hayashi

TL;DR
This paper introduces robust estimation methods for item response theory that minimize divergence-based objective functions, improving accuracy and robustness against aberrant responses compared to traditional MMLE.
Contribution
It proposes a flexible, divergence-based estimation framework that enhances robustness in IRT, encompassing MMLE as a special case and validated through simulations.
Findings
Proposed methods outperform existing estimators in simulations with aberrant responses.
Estimators are statistically consistent and asymptotically normal.
Influence function analysis shows increased hyperparameters reduce impact of aberrant responses.
Abstract
Marginal maximum likelihood estimation (MMLE) in item response theory (IRT) is highly sensitive to aberrant responses, such as careless answering and random guessing, which can reduce estimation accuracy. To address this issue, this study introduces robust estimation methods for item parameters in IRT. Instead of empirically minimizing Kullback--Leibler divergence as in MMLE, the proposed approach minimizes the objective functions based on robust divergences, specifically density power divergence and {\gamma}-divergence. The resulting estimators are statistically consistent and asymptotically normal under appropriate regularity conditions. Furthermore, they offer a flexible trade-off between robustness and efficiency through hyperparameter tuning, forming a generalized estimation framework encompassing MMLE as a special case. To evaluate the effectiveness of the proposed methods, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Criteria Decision Making · Psychometric Methodologies and Testing
