# A Bayesian Optimal Phase II Design for Randomized Immunotherapy Trials with Delayed Treatment Effects

**Authors:** Zhongheng Cai, Haitao Pan

arXiv: 2509.00238 · 2025-09-03

## TL;DR

This paper introduces a Bayesian phase II trial design tailored for immunotherapy, effectively handling delayed treatment effects and controlling type I error, with demonstrated improved power and smaller sample sizes.

## Contribution

It develops a novel Bayesian design (DTE-BOP2) that models treatment delay timing, enhancing trial efficiency and error control compared to existing methods.

## Key findings

- DTE-BOP2 controls type I error across delay scenarios.
- Power decreases as the treatment effect separation time increases.
- The design achieves higher power with smaller sample sizes.

## Abstract

Immunotherapy has transformed cancer treatment, yet its delayed therapeutic effects often lead to non-proportional hazards, rendering many conventional phase II designs underpowered and prone to type I error inflation. To address this issue, we propose a novel Bayesian Optimal Phase II design (DTE-BOP2) that explicitly models the uncertainty in the separation timing of treatment effect. The treatment separation timepoint (denoted by S) is endowed with a truncated-Gamma prior, whose parameters can be elicited from experts or inferred from historical data, with default settings available when prior knowledge is scarce. Built upon the BOP2 framework (Zhou et al. 2017, 2020), our design retains operational simplicity while incorporating type I error control and maintaining the power. Extensive simulations demonstrate that DTE-BOP2 uniformly controls type I error at the nominal level across a wide range of treatment effect separation timepoint S. We further observe that the power decreases monotonically as S increases. Importantly, we find that the power is primarily driven by the relative magnitude of treatment benefit before and after the separation time, i.e., the ratio of medians, rather than their absolute values. Compared to the original BOP2, the piecewise weighted log-rank, and the conventional log-rank tests, DTE-BOP2 achieves higher power with smaller sample sizes while preserving type I error robustness across plausible delay scenarios. An open-source R package, DTEBOP2 (CRAN), with detailed vignettes, enables investigators to implement the design and analyse phase-II trials exhibiting delayed treatment effects.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2509.00238/full.md

## Figures

14 figures with captions in the complete paper: https://tomesphere.com/paper/2509.00238/full.md

## References

18 references — full list in the complete paper: https://tomesphere.com/paper/2509.00238/full.md

---
Source: https://tomesphere.com/paper/2509.00238