Implementing multiple imputation for missing data in longitudinal studies when models are not feasible: A tutorial on the random hot deck approach
Chinchin Wang, Tyrel Stokes, Russell Steele, Niels Wedderkopp, Ian, Shrier

TL;DR
This paper introduces a practical tutorial on using random hot deck multiple imputation for handling missing data in longitudinal studies when traditional model-based methods are infeasible, ensuring plausible data imputation.
Contribution
It demonstrates how to implement random hot deck imputation in longitudinal data, providing an alternative to model-based methods under data constraints.
Findings
Effective in maintaining data plausibility
Applicable to longitudinal studies with complex constraints
Produces multiple complete datasets for analysis
Abstract
Objective: Researchers often use model-based multiple imputation to handle missing at random data to minimize bias while making the best use of all available data. However, there are sometimes constraints within the data that make model-based imputation difficult and may result in implausible values. In these contexts, we describe how to use random hot deck imputation to allow for plausible multiple imputation in longitudinal studies. Study Design and Setting: We illustrate random hot deck multiple imputation using The Childhood Health, Activity, and Motor Performance School Study Denmark (CHAMPS-DK), a prospective cohort study that measured weekly sports participation for 1700 Danish schoolchildren. We matched records with missing data to several observed records, generated probabilities for matched records using observed data, and sampled from these records based on the probability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Statistical Methods and Inference · Advanced Causal Inference Techniques
