Properties of the weighted log-rank test in the design of confirmatory studies with delayed effects
Jose L Jimenez, Viktoriya Stalbovskaya, Byron Jones

TL;DR
This paper examines how the weighted log-rank test can be effectively used in clinical trial designs with delayed effects, addressing challenges in sample size calculation and analysis when proportional hazards assumptions are violated.
Contribution
It provides an empirical evaluation of the weighted log-rank test's performance in delayed effect scenarios and offers practical guidelines for its application in adaptive trial designs.
Findings
Weighted log-rank test maintains power with delayed effects.
Incorporating Fleming-Harrington weights improves test efficiency.
Recommendations depend on trial characteristics and delay estimates.
Abstract
Proportional hazards are a common assumption when designing confirmatory clinical trials in oncology. This assumption not only affects the analysis part but also the sample size calculation. The presence of delayed effects causes a change in the hazard ratio while the trial is ongoing since at the beginning we do not observe any difference between treatment arms and after some unknown time point, the differences between treatment arms will start to appear. Hence, the proportional hazards assumption no longer holds and both sample size calculation and analysis methods to be used should be reconsidered. The weighted log-rank test allows a weighting for early, middle and late differences through the Fleming and Harrington class of weights, and is proven to be more efficient when the proportional hazards assumption does not hold. The Fleming and Harrington class of weights, along with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
