A Closer Look at Testing the "No-Treatment-Effect" Hypothesis in a Comparative Experiment
Joseph B. Lang

TL;DR
This paper clarifies the differences among various statistical tests for no-treatment-effect hypotheses, introduces a comprehensive modeling framework, and compares their assumptions and performance through simulations.
Contribution
It develops a unified framework for understanding and comparing different no-effect tests, emphasizing model assumptions and the role of randomization.
Findings
Fisher-type randomization tests are compared to permutation and Neyman-type tests.
Simulation results show differences in test power and validity.
The framework clarifies what hypotheses are tested by each method.
Abstract
Standard tests of the "no-treatment-effect" hypothesis for a comparative experiment include permutation tests, the Wilcoxon rank sum test, two-sample tests, and Fisher-type randomization tests. Practitioners are aware that these procedures test different no-effect hypotheses and are based on different modeling assumptions. However, this awareness is not always, or even usually, accompanied by a clear understanding or appreciation of these differences. Borrowing from the rich literatures on causality and finite-population sampling theory, this paper develops a modeling framework that affords answers to several important questions, including: exactly what hypothesis is being tested, what model assumptions are being made, and are there other, perhaps better, approaches to testing a no-effect hypothesis? The framework lends itself to clear descriptions of three main inference…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
