# Reevaluating the Design of Multicenter Surgical Trials for Esophagogastric Cancer

**Authors:** Bibek Das, Anuja T. Mitra, Patrick Bossuyt, George B. Hanna

PMC · DOI: 10.1245/s10434-025-18532-x · 2025-10-23

## TL;DR

This paper examines why surgical trials for esophagogastric cancer often fail to show survival benefits, pointing to flaws in trial design and surgical quality assurance.

## Contribution

The study identifies surgical variability and inadequate quality assurance as critical factors undermining the success of multicenter surgical trials.

## Key findings

- 27 RCTs were analyzed, but none showed significant survival benefits despite being powered for OS superiority.
- Suboptimal surgical quality assurance and poor adherence monitoring were common in the trials.
- Simulations showed that even 10% nonadherence could halve statistical power, requiring much larger sample sizes.

## Abstract

Multicenter randomized controlled trials (RCTs) in esophagogastric cancer surgery have repeatedly failed to demonstrate overall survival (OS) benefits for technical interventions. This raises concerns about trial design frameworks, particularly surgical quality assurance (SQA) and their impact on statistical power and sample size requirements.

We conducted a systematic review of MEDLINE, Embase, and Cochrane CENTRAL (January 1990 to June 2024) to identify multicenter RCTs evaluating curative surgical interventions for esophageal or gastric cancer that reported OS outcomes with ≥ 12 months of follow-up. Trials were assessed for SQA strategies, protocol adherence, and sample size assumptions. Post hoc power analyses and statistical simulations modeled the impact of surgical variability on trial outcomes.

We identified 27 eligible RCTs; 10 were powered for OS superiority, but none showed significant OS benefit. Suboptimal SQA was common, including limited surgeon credentialing, poor adherence monitoring, and inadequate technical standardization. Three trials were underpowered at design, and three failed to meet recruitment targets. Simulations showed that a technical nonadherence rate of 10% per arm could halve statistical power. Sample size requirements to preserve 80% power increased markedly, sometimes by thousands of participants. Most trials lacked internal piloting to assess protocol feasibility and baseline adherence.

Inadequate SQA and failure to account for surgical variability may partly explain the lack of OS benefit in multicenter surgical RCTs. Embedding SQA and internal pilot studies into trial design may improve feasibility, ensure protocol fidelity, and strengthen the validity of future trials in complex oncologic surgery.

The online version contains supplementary material available at 10.1245/s10434-025-18532-x.

## Linked entities

- **Diseases:** esophageal cancer (MONDO:0007576), gastric cancer (MONDO:0001056)

## Full-text entities

- **Diseases:** esophageal or gastric cancer (MESH:D013274), Esophagogastric Cancer (MESH:D009369)

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12765741/full.md

---
Source: https://tomesphere.com/paper/PMC12765741