Sigmoid-FTRL: Design-Based Adaptive Neyman Allocation for AIPW Estimators
Fangyi Chen, Shu Ge, Jian Qian, Christopher Harshaw

TL;DR
This paper introduces Sigmoid-FTRL, an adaptive experimental design for AIPW estimators that minimizes Neyman Regret in a non-convex setting, providing theoretical guarantees and valid confidence intervals.
Contribution
It proposes Sigmoid-FTRL, a novel adaptive procedure addressing non-convex optimization in Neyman Allocation for AIPW estimators, with proven convergence rates and minimax optimality.
Findings
Neyman Regret of Sigmoid-FTRL converges at a $T^{-1/2} R$ rate.
No adaptive design can outperform the $T^{-1/2} R$ rate under regularity conditions.
Established a central limit theorem and conservative variance estimator for valid confidence intervals.
Abstract
We consider the problem of Adaptive Neyman Allocation for the class of AIPW estimators in a design-based setting, where potential outcomes and covariates are deterministic. As each subject arrives, an adaptive procedure must select both a treatment assignment probability and a pair of linear predictors to be used in the AIPW estimator. Our goal is to construct an adaptive procedure that minimizes the Neyman Regret, which is the difference between the variance of the adaptive procedure and an oracle variance which uses the optimal non-adaptive choice of assignment probabilities and linear predictors. While previous work has drawn insightful connections between Neyman Regret and online convex optimization for the Horvitz--Thompson estimator, one of the central challenges for the AIPW estimator is that the underlying optimization is non-convex. In this paper, we propose Sigmoid-FTRL, an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Advanced Bandit Algorithms Research · Statistical Methods and Inference
