Evaluating individualized treatment effect predictions: a model-based perspective on discrimination and calibration assessment
J Hoogland, O Efthimiou, TL Nguyen, TPA Debray

TL;DR
This paper develops and evaluates model-based metrics for assessing the discrimination and calibration of individualized treatment effect prediction models, emphasizing their validation in randomized trial data.
Contribution
It introduces novel model-based discrimination and calibration measures grounded in outcome risk prediction, enhancing validation of treatment effect models.
Findings
Model-based metrics outperform traditional measures in bias and accuracy.
Resampling methods have high variance, limiting their reliability.
Independent data validation is recommended for better assessment.
Abstract
In recent years, there has been a growing interest in the prediction of individualized treatment effects. While there is a rapidly growing literature on the development of such models, there is little literature on the evaluation of their performance. In this paper, we aim to facilitate the validation of prediction models for individualized treatment effects. The estimands of interest are defined as based on the potential outcomes framework, which facilitates a comparison of existing and novel measures. In particular, we examine existing measures of measures of discrimination for benefit (variations of the c-for-benefit), and propose model-based extensions to the treatment effect setting for discrimination and calibration metrics that have a strong basis in outcome risk prediction. The main focus is on randomized trial data with binary endpoints and on models that provide individualized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHealth Systems, Economic Evaluations, Quality of Life · Advanced Causal Inference Techniques · Meta-analysis and systematic reviews
