G-computation for increasing performances of clinical trials with individual randomization and binary response
Joe de Keizer, R\'emi Lenain, Rapha\"el Porcher, Sarah Zoha, Arthur, Chatton, Yohann Foucher

TL;DR
This study explores the use of G-computation combined with machine learning techniques to improve the power of randomized clinical trials by effectively adjusting for prognostic factors, especially in small sample sizes.
Contribution
It demonstrates that penalized regression methods like Elasticnet with splines enhance trial power and reduce sample size needs, offering a flexible adjustment approach for complex outcome relationships.
Findings
Penalized regressions reduce variance and sample size.
Algorithm-based methods underestimate standard deviation in small samples.
Elasticnet with splines is particularly effective for small sample adjustments.
Abstract
In a clinical trial, the random allocation aims to balance prognostic factors between arms, preventing true confounders. However, residual differences due to chance may introduce near-confounders. Adjusting on prognostic factors is therefore recommended, especially because the related increase of the power. In this paper, we hypothesized that G-computation associated with machine learning could be a suitable method for randomized clinical trials even with small sample sizes. It allows for flexible estimation of the outcome model, even when the covariates' relationships with outcomes are complex. Through simulations, penalized regressions (Lasso, Elasticnet) and algorithm-based methods (neural network, support vector machine, super learner) were compared. Penalized regressions reduced variance but may introduce a slight increase in bias. The associated reductions in sample size ranged…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Receptor Mechanisms and Signaling · SARS-CoV-2 detection and testing
