Conditional bias reduction can be dangerous: a key example from sequential analysis
Ben Berckmoes, Anna Ivanova, Geert Molenberghs

TL;DR
This paper demonstrates that applying conditional bias reduction in sequential analysis can lead to infinite mean absolute error, highlighting potential dangers of this approach.
Contribution
It provides a critical example showing that conditional bias reduction may have unintended harmful effects in sequential analysis.
Findings
Conditional bias reduction can cause infinite mean absolute error.
The paper offers a key example illustrating this danger.
Highlights the need for caution in bias reduction methods.
Abstract
We present a key example from sequential analysis, which illustrates that conditional bias reduction can cause infinite mean absolute error.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Optimal Experimental Design Methods · Advanced Statistical Process Monitoring
Conditional bias reduction can be dangerous:
a key example from sequential analysis
Ben Berckmoes, Anna Ivanova, Geert Molenberghs
Abstract.
We present a key example from sequential analysis, which illustrates that conditional bias reduction can cause infinite mean absolute error.
Key words and phrases:
conditional MLE, marginal MLE, group sequential trial, mean absolute error
Ben Berckmoes is post doctoral fellow at the Fund for Scientific Research of Flanders (FWO)
Financial support from the IAP research network #P7/06 of the Belgian Government (Belgian Science Policy) is gratefully acknowledged.
1. Introduction
The following group sequential paradigm has been studied extensively in the literature, see e.g. [BIM18, C89, EF90, FDL00, HP88, LH99, MKA14, W92].
Let be independent and identically distributed observations with normal law , and, for each , an -valued random sample size that solely depends on through the stopping rule
[TABLE]
where is a Borel measurable map of into , a shape parameter, and, for , . The choice leads to Pocock boundaries ([P77]) and the choice to O’Brien-Fleming boundaries ([OF79]).
The above setting models the idea that, after having collected the data , it is decided, based on the stopping rule (1), if the trial is stopped (that is, the final sample size is ), or continued (that is, the additional data are collected and the final sample size is ).
Assuming known, the following estimators for the location parameter are often discussed in the literature ([FDL00],[MKA14]):
(a) the marginal MLE, defined by the parameter value that maximizes the marginal likelihood
[TABLE]
where is the standard normal density. Of course, the marginal MLE is the ordinary sample mean
[TABLE]
This approach is simple, because it is based on the likelihood of the collected data only, without taking the stopping mechanism into account. The marginal MLE has been criticized in the literature, because it has potentially large bias ([EF90]). However, it was shown in [BIM18] that in many cases the bias vanishes quickly if grows.
(b) the conditional MLE , defined by the parameter value that, for , maximizes the conditional likelihood
[TABLE]
This approach is complex, because contrary to the marginal MLE, it also models the stopping mechanism. An explicit value for the conditional MLE cannot be obtained, and one has to rely on numerical methods to calculate it. However, the conditional MLE, also known as the conditional bias reduction estimate ([FDL00]), is favored by the literature because it is claimed to reduce bias by taking all information into account.
In this paper, we will show that if we take , , , and arbitrary, then
[TABLE]
That is, conditional bias reduction can cause infinite mean absolute error.
2. Mean absolute error
We keep the setting of the previous section, and we take , , , and arbitrary. So the stopping rule (1) is now turned into
[TABLE]
That is, after having collected the -data , the trial is stopped if and continued otherwise.
We first focus on the marginal MLE . Let be the standard normal density and the standard normal cumulative distribution function. Following [BIM18], we see that the joint density of and is given by
[TABLE]
and
[TABLE]
We learn from (5) and (2) that
[TABLE]
with a standard normally distributed random variable. It clearly follows from (2) that
[TABLE]
That is, the mean absolute error of with respect to the true parameter [math] vanishes if .
We now turn to the conditional MLE , which maximizes the conditional likelihood (4). It is easily seen that this estimator is obtained by solving the equation
[TABLE]
with , in the case , and the equation
[TABLE]
with , in the case . One checks numerically that the map strictly increases on from [math] to and that the map strictly increases on from to . In particular, and are bijective, from which it follows that is uniquely defined by
[TABLE]
if , and
[TABLE]
if . Applying the Transformation Theorem, and using (5) and (9), we get, for each and each ,
[TABLE]
which, by the well known integral equality
[TABLE]
which, plugging in the definition of and calculating the integral,
[TABLE]
It can be checked numerically that, for fixed , expression (11) tends to if . We conclude that
[TABLE]
We infer from (8) and (12) that conditional bias reduction can cause infinite mean absolute error.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[BIM 18] Berckmoes, B.; Ivanova, A.; Molenberghs, G. (2018) On the sample mean after a group sequential trial Computational Statistics & Data Analysis, 125, 104-118. https://arxiv.org/pdf/1706.01291.pdf
- 2[C 89] Chang, M. N. (1989) Confidence intervals for a normal mean following a group sequential test. Biometrics 45, no. 1, 247–254.
- 3[EF 90] Emerson, S. S.; Fleming, T. R. (1990) Parameter estimation following group sequential hypothesis testing. Biometrika 77, 875–892.
- 4[FDL 00] Fan, X. F.; De Mets, D. L.; Lan, G. (2000) Bias point of estimation following a group sequential test. Technical report https://www.biostat.wisc.edu/sites/default/files/tr_157.pdf
- 5[HP 88] Hughes, M.D.; Pocock, S.J. (1988) Stopping rules and estimation problems in clinical trials. Statistics in Medicine 7, 1231–1242.
- 6[LH 99] Liu, A.; Hall, W. J. (1999) Unbiased estimation following a group sequential test. Biometrika 86, 71–78.
- 7[MKA 14] Molenberghs, G.; Kenward, M. G.; Aerts, M.; Verbeke, G.; Tsiatis, A. A.; Davidian, M.; Rizopoulos, D. (2014) On random sample size, ignorability, ancillarity, completeness, separability, and degeneracy: sequential trials, random sample sizes, and missing data. Stat. Methods Med. Res. 23, no. 1, 11–41.
- 8[S 78] Siegmund, D. (1978) Estimation following sequential tests. Biometrika 64, 191–199.
