Blurring cluster randomized trials and observational studies using Two-Stage TMLE to address sub-sampling, missingness, and minimal independent units
Joshua R. Nugent, Carina Marquez, Edwin D. Charlebois, Rachel Abbott,, Laura B. Balzer (for the SEARCH Collaboration)

TL;DR
This paper develops and applies advanced statistical methods to improve the analysis of cluster randomized trials with sub-sampling and missing data, aiming to reduce bias and increase efficiency.
Contribution
It extends Two-Stage TMLE to handle multiple sources of missingness and critically evaluates assumptions about independence within clusters, enhancing analysis accuracy.
Findings
Improved bias reduction in CRT estimates.
Enhanced statistical power through new assumptions.
Application to SEARCH trial demonstrates practical benefits.
Abstract
Cluster randomized trials (CRTs) often enroll large numbers of participants, but due to logistical and fiscal challenges, only a subset of participants may be selected for measurement of certain outcomes, and those sampled may, purposely or not, be unrepresentative of all participants. Missing data also present a challenge: if sampled individuals with measured outcomes are dissimilar from those with missing outcomes, unadjusted estimates of arm-specific outcomes and the intervention effect may be biased. Further, CRTs often enroll and randomize few clusters by necessity, limiting statistical power and raising concerns about finite sample performance. Motivated by a sub-study of the SEARCH community randomized trial on the incidence of TB infection, we demonstrate interlocking methods to handle these challenges. First, we extend Two-Stage targeted minimum loss-based estimation (TMLE) to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMeta-analysis and systematic reviews · Advanced Causal Inference Techniques · Statistical Methods and Bayesian Inference
