Estimating Heterogeneous Treatment Effects with Item-Level Outcome Data: Insights from Item Response Theory
Joshua B. Gilbert, Zachary Himmelsbach, James Soland, Mridul Joshi, Benjamin W. Domingue

TL;DR
This paper introduces an Item Response Theory-based approach to estimate heterogeneous treatment effects at the item level, improving accuracy and insights in causal inference with psychometric outcome data.
Contribution
It develops a novel IL-HTE model using IRT that captures item-level treatment effects, addressing standard method limitations and enhancing causal effect estimation.
Findings
IL-HTE reveals item-level variation masked by total scores
Provides more accurate standard errors in treatment effect estimates
Enables generalization of effects to untested items
Abstract
Analyses of heterogeneous treatment effects (HTE) are common in applied causal inference research. However, when outcomes are latent variables assessed via psychometric instruments such as educational tests, standard methods ignore the potential HTE that may exist among the individual items of the outcome measure. Failing to account for "item-level" HTE (IL-HTE) can lead to both underestimated standard errors and identification challenges in the estimation of treatment-by-covariate interaction effects. We demonstrate how Item Response Theory (IRT) models that estimate a treatment effect for each assessment item can both address these challenges and provide new insights into HTE generally. This study articulates the theoretical rationale for the IL-HTE model and demonstrates its practical value using 75 datasets from 48 randomized controlled trials containing 5.8 million item responses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsResilience and Mental Health · Psychometric Methodologies and Testing · Evaluation and Performance Assessment
MethodsCausal inference
