# Can expected error costs justify testing a hypothesis at multiple alpha levels rather than searching for an elusive optimal alpha?

**Authors:** Janet Aisbett, Stephan Leitner, Stephan Leitner, Stephan Leitner, Stephan Leitner

PMC · DOI: 10.1371/journal.pone.0304675 · 2024-09-25

## TL;DR

This paper explores whether testing a hypothesis at multiple alpha levels can reduce expected error costs compared to searching for a single optimal alpha.

## Contribution

It introduces a method for multi-alpha level testing and compares its expected error costs to single-alpha and optimal approaches.

## Key findings

- Multi-alpha level tests can yield acceptable expected total error costs.
- Optimization of error costs is highly sensitive to assumptions about prevalence and cost estimates.
- Using multiple default thresholds may lead to lower average error costs than mis-specified optimal models.

## Abstract

Simultaneous testing of one hypothesis at multiple alpha levels can be performed within a conventional Neyman-Pearson framework. This is achieved by treating the hypothesis as a family of hypotheses, each member of which explicitly concerns test level as well as effect size. Such testing encourages researchers to think about error rates and strength of evidence in both the statistical design and reporting stages of a study. Here, we show that these multi-alpha level tests can deliver acceptable expected total error costs. We first present formulas for expected error costs from single alpha and multiple alpha level tests, given prior probabilities of effect sizes that have either dichotomous or continuous distributions. Error costs are tied to decisions, with different decisions assumed for each of the potential outcomes in the multi-alpha level case. Expected total costs for tests at single and multiple alpha levels are then compared with optimal costs. This comparison highlights how sensitive optimization is to estimated error costs and to assumptions about prevalence. Testing at multiple default thresholds removes the need to formally identify decisions, or to model costs and prevalence as required in optimization approaches. Although total expected error costs with this approach will not be optimal, our results suggest they may be lower, on average, than when “optimal” test levels are based on mis-specified models.

## Full-text entities

- **Diseases:** COVID-19 (MESH:D000086382), weight loss (MESH:D015431), death (MESH:D003643), COMMENT 4a (OMIM:600652), obesity (MESH:D009765), dying (MESH:D064806), carcinogenic (MESH:D011230), COMMENT 1c (MESH:C536486), MISSING AXIS LABELS (MESH:D000030), II (MESH:C537730), COMMENT 4 (MESH:D053632), overweight (MESH:D050177)
- **Chemicals:** fish oil (MESH:D005395), fiber (MESH:D004043), fat (MESH:D005223), PONE-D-23 (-), Molnupiravir (MESH:C000656703)
- **Species:** Homo sapiens (human, species) [taxon 9606], Canis lupus familiaris (dog, subspecies) [taxon 9615]
- **Cell lines:** S2 — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_Z232)

## Figures

21 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11424007/full.md

---
Source: https://tomesphere.com/paper/PMC11424007