# Overcoming Model Uncertainty — How Equivalence Tests Can Benefit From Model Averaging

**Authors:** Niklas Hagemann, Kathrin Möllenhoff

PMC · DOI: 10.1002/sim.10309 · Statistics in Medicine · 2025-03-19

## TL;DR

This paper introduces a new method using model averaging to improve the accuracy of equivalence tests when the true model is unknown.

## Contribution

The novel approach uses model averaging with Bayesian information criterion weights to handle model uncertainty in equivalence testing.

## Key findings

- Model averaging reduces type I errors and improves test accuracy under model misspecification.
- The method was validated through simulations and a toxicological gene expression case study.
- The proposed test procedure leverages the duality between confidence intervals and hypothesis testing.

## Abstract

A common problem in numerous research areas, particularly in clinical trials, is to test whether the effect of an explanatory variable on an outcome variable is equivalent across different groups. In practice, these tests are frequently used to compare the effect between patient groups, for example, based on gender, age, or treatments. Equivalence is usually assessed by testing whether the difference between the groups does not exceed a pre‐specified equivalence threshold. Classical approaches are based on testing the equivalence of single quantities, for example, the mean, the area under the curve or other values of interest. However, when differences depending on a particular covariate are observed, these approaches can turn out to be not very accurate. Instead, whole regression curves over the entire covariate range, describing for instance the time window or a dose range, are considered and tests are based on a suitable distance measure of two such curves, as, for example, the maximum absolute distance between them. In this regard, a key assumption is that the true underlying regression models are known, which is rarely the case in practice. However, misspecification can lead to severe problems as inflated type I errors or, on the other hand, conservative test procedures. In this paper, we propose a solution to this problem by introducing a flexible extension of such an equivalence test using model averaging in order to overcome this assumption and making the test applicable under model uncertainty. Precisely, we introduce model averaging based on smooth Bayesian information criterion weights and we propose a testing procedure which makes use of the duality between confidence intervals and hypothesis testing. We demonstrate the validity of our approach by means of a simulation study and illustrate its practical relevance considering a time‐response case study with toxicological gene expression data.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11923417/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11923417/full.md

## References

49 references — full list in the complete paper: https://tomesphere.com/paper/PMC11923417/full.md

---
Source: https://tomesphere.com/paper/PMC11923417