Using Auxiliary Data to Boost Precision in the Analysis of A/B Tests on an Online Educational Platform: New Data and New Results
Adam C. Sales, Ethan B. Prihar, Johann A. Gagnon-Bartsch, Neil T., Heffernan

TL;DR
This paper demonstrates that incorporating auxiliary data through deep learning models significantly improves the precision of causal effect estimates in A/B tests on an online educational platform, effectively increasing sample size and accuracy.
Contribution
It applies recent machine learning-based causal inference methods to real educational data, showing substantial gains in precision and subgroup analysis capabilities.
Findings
Auxiliary data integration increases effective sample size by up to 80%.
Deep learning models improve subgroup effect estimation accuracy.
Method remains unbiased and effective even with unrepresentative remnant data.
Abstract
Randomized A/B tests within online learning platforms represent an exciting direction in learning sciences. With minimal assumptions, they allow causal effect estimation without confounding bias and exact statistical inference even in small samples. However, often experimental samples and/or treatment effects are small, A/B tests are underpowered, and effect estimates are overly imprecise. Recent methodological advances have shown that power and statistical precision can be substantially boosted by coupling design-based causal estimation to machine-learning models of rich log data from historical users who were not in the experiment. Estimates using these techniques remain unbiased and inference remains exact without any additional assumptions. This paper reviews those methods and applies them to a new dataset including over 250 randomized A/B comparisons conducted within ASSISTments,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Online Learning and Analytics
