# Comparison of primary analysis strategies of randomized controlled trials with multiple endpoints with application to kidney transplantation

**Authors:** Felix Herkner, Martin Posch, Gregor Bond, Franz König

PMC · DOI: 10.1038/s41598-026-38979-6 · 2026-02-13

## TL;DR

The paper compares different statistical methods for analyzing clinical trials with multiple endpoints, showing that some methods are more powerful but may be harder to interpret.

## Contribution

The study evaluates three primary analysis strategies for RCTs with multiple endpoints, highlighting their trade-offs in power and interpretability.

## Key findings

- Global testing strategies like composite endpoints and generalized pairwise comparisons showed higher power than multiple testing with correction.
- Global methods can produce significant results even with heterogeneous treatment effects across endpoints.
- The study emphasizes the need for careful interpretation and component-wise assessment when using global procedures.

## Abstract

Relying on a single primary endpoint in randomized controlled trials (RCTs) is often infeasible, for example due to rare or heterogeneous events. Regulatory guidance therefore allows multiple endpoints, but different analytical strategies address different scientific questions and null hypotheses, even when applied to the same set of variables. We explored three approaches to consider multiple endpoints in the primary analysis of RCTs, as stated in the FDA and EMA guidelines on multiplicity: (i) a composite endpoint (CE), (ii) multiple testing and multiplicity correction (MTMC), and (iii) a hierarchical non-parametric procedure, called generalized pairwise comparisons (GPC). Using clinical trial simulations, we compared these strategies’ power in two-arm RCTs perform when testing strategy-specific hypotheses across a range of scenarios reflecting endpoint prioritization, correlation between endpoints, and opposing treatment effects. When testing time-to-event endpoints, global testing strategies (CE and GPC) generally achieved higher power than MTMC. However, we also demonstrate that global procedures may yield statistically significant results even when treatment effects are heterogeneous across endpoints, underscoring the importance of careful interpretation and component-wise assessment. As trials increasingly use multiple endpoints, understanding the trade-off between statistical efficiency and interpretability, and provide practical guidance for choosing endpoint definitions and primary analysis strategies in future trials.

## Full-text entities

- **Diseases:** Infections (MESH:D007239), CE (MESH:D058617), Death (MESH:D003643), cardiovascular disease (MESH:D002318), MTMC (MESH:D013736), graft (MESH:D055589), malignancies (MESH:D009369), GPC (MESH:D004829), infectious diseases (MESH:D003141)
- **Chemicals:** TTVguideIT (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12982691/full.md

---
Source: https://tomesphere.com/paper/PMC12982691