Construction and assessment of prediction rules for binary outcome in the presence of missing predictor data using multiple imputation: theoretical perspective and data-based evaluation
B. J. A. Mertens, E. Banzato, L.C. de Wreede

TL;DR
This paper develops and evaluates methods for calibrating and assessing binary outcome prediction rules with missing predictor data using multiple imputation, ensuring unbiased performance evaluation and practical implementation.
Contribution
It introduces two novel approaches for integrating multiple imputation with predictive calibration, compatible with existing software, and compares their effectiveness.
Findings
Little difference in accuracy between methods.
Significant reduction in variability of calibrated probabilities with the first approach.
Methods are applicable with current software for validation and imputation.
Abstract
We investigate the problem of calibration and assessment of predictive rules in prognostic designs when missing values are present in the predictors. Our paper has two key objectives which are entwined. The first is to investigate how the calibration of the prediction rule can be combined with the use of multiple imputation to account for missing predictor observations. The second objective is to propose such methods that can be implemented with current multiple imputation software, while allowing for unbiased predictive assessment through validation on new observations for which outcome is not yet available. To inform the definition of methodology, we commence with a review of the theoretical background of multiple imputation as a model estimation approach as opposed to a purely algorithmic description. We specifically contrast application of multiple imputation for parameter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Criteria Decision Making · Advanced Statistical Modeling Techniques · Statistical Methods and Bayesian Inference
