Equivalence testing with data-dependent and post-hoc equivalence margins
Stan Koobs, Nick W. Koning

TL;DR
This paper proposes a data-dependent, probabilistically bounded equivalence margin for testing effects, offering more practical and flexible decision-making tools than traditional fixed-margin approaches, supported by e-value methods.
Contribution
It introduces a novel paradigm of reporting data-dependent equivalence margins with probabilistic guarantees, generalizes to margin curves, and derives e-values for models with strict total positivity.
Findings
Data-dependent margins bound the true effect with high probability.
The approach generalizes to margin curves under post-hoc selection.
E-values are derived for models with strict total positivity.
Abstract
Equivalence testing compares the hypothesis that an effect is large against the alternative that it is negligible. Here, `large' is classically expressed as being larger than some `equivalence margin' . A longstanding problem is that this margin must be specified but can rarely be objectively justified in practice. We lay the foundation for an alternative paradigm, arguing to instead report a data-dependent margin that bounds the true effect with probability . Our key argument is that is more useful than a test outcome at a fixed margin , as measured by the guarantees it offers to decision makers. We generalize this to a curve of margins , uniformly valid under the post-hoc selection of the margin. These ideas rely on e-values, which we derive for models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Statistical Methods and Inference · Advanced Causal Inference Techniques
