Using Auxiliary Data to Boost Precision in the Analysis of A/B Tests on   an Online Educational Platform: New Data and New Results

Adam C. Sales; Ethan B. Prihar; Johann A. Gagnon-Bartsch; Neil T.; Heffernan

arXiv:2306.06273·stat.ME·June 13, 2023·1 cites

Using Auxiliary Data to Boost Precision in the Analysis of A/B Tests on an Online Educational Platform: New Data and New Results

Adam C. Sales, Ethan B. Prihar, Johann A. Gagnon-Bartsch, Neil T., Heffernan

PDF

Open Access

TL;DR

This paper demonstrates that incorporating auxiliary data through deep learning models significantly improves the precision of causal effect estimates in A/B tests on an online educational platform, effectively increasing sample size and accuracy.

Contribution

It applies recent machine learning-based causal inference methods to real educational data, showing substantial gains in precision and subgroup analysis capabilities.

Findings

01

Auxiliary data integration increases effective sample size by up to 80%.

02

Deep learning models improve subgroup effect estimation accuracy.

03

Method remains unbiased and effective even with unrepresentative remnant data.

Abstract

Randomized A/B tests within online learning platforms represent an exciting direction in learning sciences. With minimal assumptions, they allow causal effect estimation without confounding bias and exact statistical inference even in small samples. However, often experimental samples and/or treatment effects are small, A/B tests are underpowered, and effect estimates are overly imprecise. Recent methodological advances have shown that power and statistical precision can be substantially boosted by coupling design-based causal estimation to machine-learning models of rich log data from historical users who were not in the experiment. Estimates using these techniques remain unbiased and inference remains exact without any additional assumptions. This paper reviews those methods and applies them to a new dataset including over 250 randomized A/B comparisons conducted within ASSISTments,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Causal Inference Techniques · Online Learning and Analytics