Exact and Conservative Inference for the Average Treatment Effect in Stratified Experiments with Binary Outcomes
Jiaxun Li, Jacob Spertus, and Philip B. Stark

TL;DR
This paper develops three methods for finite-sample inference of the average treatment effect in stratified experiments with binary outcomes, balancing statistical efficiency and computational feasibility.
Contribution
It introduces three novel methods for exact and conservative inference of ATE in stratified binary outcome experiments, including permutation-based approaches.
Findings
Second method is most statistically efficient in simulations.
Computational complexity varies across methods, with trade-offs.
Methods are validated through simulations and a case study.
Abstract
We extend methods for finite-sample inference about the average treatment effect (ATE) in randomized experiments with binary outcomes to accommodate stratification (blocking). We present three valid methods that differ in their computational and statistical efficiency. The first method constructs conservative, Bonferroni-adjusted confidence intervals separately for the mean response in the treatment and control groups in each stratum, then takes appropriate weighted differences of their endpoints to find a confidence interval for the ATE. The second method inverts permutation tests for the overall ATE, maximizing the -value over all ways a given ATE can be attained. The third method applies permutation tests for the ATE in separate strata, then combines those tests to form a confidence interval for the overall ATE. We compare the statistical and computational performance of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
