Component over Composite: Mitigating Type I Error Inflation when Imputing "Days Alive and at Home"
Mia S. Tackney, Sarah Dawson, Letao Yuan, Dominique-Laurent Couturier, Sofia S. Villar

TL;DR
This paper evaluates methods for handling missing data in the Days Alive and at Home (DAH) outcome, highlighting the risks of type I error inflation with certain imputation techniques and recommending component-level imputation.
Contribution
It provides a simulation-based comparison of missing data methods for DAH, emphasizing the advantages of component-level imputation over composite-level approaches.
Findings
Multiple Imputation on components controls type I error well.
Imputing the composite with Predictive Mean Matching can inflate type I error.
Naive imputation methods may lead to incorrect conclusions.
Abstract
Background: Days Alive and at Home (DAH) over a pre-defined follow-up period is a novel post-intervention composite outcome that combines data from at least three components: (i) initial length of hospital stay, (ii) length of total readmissions or other post-discharge care and (iii) mortality. Missing values bring unique challenges to the analysis of trials with the DAH outcome as the three components may have different rates of missingness caused by distinct missing data mechanisms. Current approaches define DAH as missing if any of the components are missing, and proceed with complete cases or Multiple Imputation (MI) of the composite. Methods: Through a simulation study motivated by the NOTACS trial, we compare several methods of handling missing data, including complete case analysis, MI of the composite, and MI of the components when the primary analysis is a Mann-Whitney-Wilcoxon…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
