Non-parametric assessment of the calibration of individualized treatment effects
Mohsen Sadatsafavi, Jeroen Hoogland, Thomas P.A. Debray, John Petkau

TL;DR
This paper introduces non-parametric, inferential methods to assess the moderate calibration of individualized treatment effect models for binary outcomes, addressing unobserved counterfactuals and continuous predictions.
Contribution
It proposes novel numerical, graphical, and inferential tools based on stochastic processes and Brownian motion for calibration assessment without parametric assumptions.
Findings
Methods accurately detect miscalibration in simulations
Approaches are applicable to randomized trial data
Graphical tools aid interpretation of calibration results
Abstract
An important aspect of the performance of algorithms that predict individualized treatment effects (ITE) is moderate calibration, i.e., the average treatment effect among individuals with predicted treatment effect of z being equal to z. The assessment of moderate calibration is a challenging task on two fronts: counterfactual responses are unobserved, and quantifying the conditional response function for models that generate continuous predicted values requires regularization or parametric modeling. Perhaps because of these challenges, there is currently no inferential method for the null hypothesis that an ITE model is moderately calibrated in a population. In this work, we propose non-parametric methods for the assessment of moderate calibration of ITE models for binary outcomes using data from a randomized trial. These methods simultaneously resolve both challenges, resulting in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
