TL;DR
This paper introduces FedIRT, a privacy-preserving federated framework for psychometric estimation that maintains accuracy and robustness without sharing raw data, and provides an open-source R package.
Contribution
It develops FedIRT and FedIRT-DP, enabling distributed IRT calibration with formal privacy guarantees and robustness, along with an open-source implementation.
Findings
FedIRT matches centralized estimator accuracy in simulations.
FedIRT-DP achieves comparable accuracy under stronger privacy constraints.
The approach improves robustness to response contamination.
Abstract
Item Response Theory (IRT) models are widely used to estimate respondents' latent abilities and calibrate item difficulty. Traditional IRT estimation typically requires centralizing all raw responses, raising privacy and governance concerns. We introduce Federated Item Response Theory (FedIRT), a framework that enables distributed calibration of standard IRT models without transferring individual-level data, thereby preserving confidentiality while retaining statistical efficiency. To provide formal protection, we further develop FedIRT-DP, a user-level differentially private extension. Each site computes per-student gradients, clips them to a fixed norm, and shares only masked sums; the server adds calibrated Gaussian noise and performs MAP updates. This yields an auditable guarantee at the student level and a single, tunable privacy-utility trade-off via the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
