Robustness of ANCOVA in randomised trials with unequal randomisation

Jonathan W. Bartlett

arXiv:1905.08693·stat.ME·May 22, 2019

Robustness of ANCOVA in randomised trials with unequal randomisation

Jonathan W. Bartlett

PDF

Open Access

TL;DR

This paper investigates the robustness of ANCOVA in randomized trials with unequal randomization, showing that the sandwich standard error is preferable over the model-based one when randomization probabilities differ from 1/2.

Contribution

It extends previous results by analyzing ANCOVA's standard error properties under unequal randomization, recommending the sandwich estimator for valid inference.

Findings

01

Model-based standard error is inconsistent with unequal randomization.

02

Sandwich standard error provides valid inference under misspecification.

03

Results guide best practices for analyzing randomized trials with unequal allocation.

Abstract

Randomised trials with continuous outcomes are often analysed using ANCOVA, with adjustment for prognostic baseline covariates. In an article published recently, Wang \etal proved that in this setting the model based standard error estimator for the treamtent effect is consistent under outcome model misspecification, provided the probability of randomisation to each treatment is 1/2. In this article, we extend their results allowing for unequal randomisation. These demonstrate that the model based standard error is in general inconsistent when the randomisation probability differs from 1/2. In contrast, the sandwich standard error can provide asymptotically valid inferences under misspecification when randomisation probabilities are not equal, and is therefore recommended when randomisation is unequal.

Equations40

E (Y ∣ A, W) = β_{0} + β_{A} A + β_{W}^{T} W

E (Y ∣ A, W) = β_{0} + β_{A} A + β_{W}^{T} W

V a r (\hat{Δ}^{an co v a}) = \frac{V a r ( Y - β ^ _{0} - β ^ _{A} A - β ^ _{W}^{T} W )}{( n - 1 ) [ V a r ( A ) - C o v ( W , A ) ^{T} V a r ( W ) ^{- 1} C o v ( W , A ) ]}

V a r (\hat{Δ}^{an co v a}) = \frac{V a r ( Y - β ^ _{0} - β ^ _{A} A - β ^ _{W}^{T} W )}{( n - 1 ) [ V a r ( A ) - C o v ( W , A ) ^{T} V a r ( W ) ^{- 1} C o v ( W , A ) ]}

V a r^{*} (\hat{Δ}^{an co v a}) = \frac{V a r ( Y - β _{W}^{T} W ∣ A = 1 )}{π} + \frac{V a r ( Y - β _{W}^{T} W ∣ A = 0 )}{1 - π}

V a r^{*} (\hat{Δ}^{an co v a}) = \frac{V a r ( Y - β _{W}^{T} W ∣ A = 1 )}{π} + \frac{V a r ( Y - β _{W}^{T} W ∣ A = 0 )}{1 - π}

n V a r (\hat{Δ}^{an co v a}) P \frac{V a r ( Y - β _{W}^{T} W ∣ A = 1 )}{1 - π} + \frac{V a r ( Y - β _{W}^{T} W ∣ A = 0 )}{π}

n V a r (\hat{Δ}^{an co v a}) P \frac{V a r ( Y - β _{W}^{T} W ∣ A = 1 )}{1 - π} + \frac{V a r ( Y - β _{W}^{T} W ∣ A = 0 )}{π}

ψ_{β} (Y, A, W) = (Y - β_{0} - β_{A} A - β_{W}^{T} W) 1 A W

ψ_{β} (Y, A, W) = (Y - β_{0} - β_{A} A - β_{W}^{T} W) 1 A W

I F_{\hat{β}} (Y, A, W) = - [E (\frac{\partial ψ _{\underline{β}} ( Y , A , W )}{\partial β ^{T}})]^{- 1} ψ_{\underline{β}} (Y, A, W)

I F_{\hat{β}} (Y, A, W) = - [E (\frac{\partial ψ _{\underline{β}} ( Y , A , W )}{\partial β ^{T}})]^{- 1} ψ_{\underline{β}} (Y, A, W)

I F_{an co v a} (Y, A, W) = \frac{A - π}{π ( 1 - π )} (Y - \underline{β}_{0} - \underline{β}_{A} A - \underline{β}_{W}^{T} W)

I F_{an co v a} (Y, A, W) = \frac{A - π}{π ( 1 - π )} (Y - \underline{β}_{0} - \underline{β}_{A} A - \underline{β}_{W}^{T} W)

V a r (I F_{an co v a} (Y, A, W))

V a r (I F_{an co v a} (Y, A, W))

V a r (I F_{an co v a} (Y, A, W))

V a r (I F_{an co v a} (Y, A, W))

n V a r (\hat{Δ}^{an co v a}) P \frac{V a r ( Y - β _{A} A - β _{W}^{T} W )}{π ( 1 - π )}

n V a r (\hat{Δ}^{an co v a}) P \frac{V a r ( Y - β _{A} A - β _{W}^{T} W )}{π ( 1 - π )}

V a r (Y - \underline{β}_{A} A - \underline{β}_{W}^{T} W) = V a r (E (Y - \underline{β}_{A} A - \underline{β}_{W}^{T} W ∣ A)) + E (V a r (Y - \underline{β}_{A} A - \underline{β}_{W}^{T} W ∣ A))

V a r (Y - \underline{β}_{A} A - \underline{β}_{W}^{T} W) = V a r (E (Y - \underline{β}_{A} A - \underline{β}_{W}^{T} W ∣ A)) + E (V a r (Y - \underline{β}_{A} A - \underline{β}_{W}^{T} W ∣ A))

E (V a r (Y - \underline{β}_{A} A - \underline{β}_{W}^{T} W ∣ A))

E (V a r (Y - \underline{β}_{A} A - \underline{β}_{W}^{T} W ∣ A))

E [(Y - \underline{β}_{0} - \underline{β}_{A} A - \underline{β}_{W}^{T} W) A] = 0

E [(Y - \underline{β}_{0} - \underline{β}_{A} A - \underline{β}_{W}^{T} W) A] = 0

π E [Y - \underline{β}_{0} - \underline{β}_{A} A - \underline{β}_{W}^{T} W ∣ A = 1] = 0

π E [Y - \underline{β}_{0} - \underline{β}_{A} A - \underline{β}_{W}^{T} W ∣ A = 1] = 0

π E (Y - \underline{β}_{0} - \underline{β}_{A} A - \underline{β}_{W}^{T} W ∣ A = 1) + (1 - π) E (Y - \underline{β}_{0} - \underline{β}_{A} A - \underline{β}_{W}^{T} W ∣ A = 0) = 0

π E (Y - \underline{β}_{0} - \underline{β}_{A} A - \underline{β}_{W}^{T} W ∣ A = 1) + (1 - π) E (Y - \underline{β}_{0} - \underline{β}_{A} A - \underline{β}_{W}^{T} W ∣ A = 0) = 0

E [Y - \underline{β}_{0} - \underline{β}_{A} A - \underline{β}_{W}^{T} W ∣ A = 0] = 0

E [Y - \underline{β}_{0} - \underline{β}_{A} A - \underline{β}_{W}^{T} W ∣ A = 0] = 0

V a r (E (Y - \underline{β}_{0} - \underline{β}_{A} A - \underline{β}_{W}^{T} W ∣ A)) = 0

V a r (E (Y - \underline{β}_{0} - \underline{β}_{A} A - \underline{β}_{W}^{T} W ∣ A)) = 0

V a r (E (Y - \underline{β}_{A} A - \underline{β}_{W}^{T} W ∣ A))

V a r (E (Y - \underline{β}_{A} A - \underline{β}_{W}^{T} W ∣ A))

V a r (Y - \underline{β}_{A} A - \underline{β}_{W}^{T} W) = π V a r (Y - \underline{β}_{W}^{T} W ∣ A = 1) + (1 - π) V a r (Y - \underline{β}_{W}^{T} W ∣ A = 0)

V a r (Y - \underline{β}_{A} A - \underline{β}_{W}^{T} W) = π V a r (Y - \underline{β}_{W}^{T} W ∣ A = 1) + (1 - π) V a r (Y - \underline{β}_{W}^{T} W ∣ A = 0)

n V a r (\hat{Δ}^{an co v a})

n V a r (\hat{Δ}^{an co v a})

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Statistical Methods in Clinical Trials · Advanced Causal Inference Techniques

Full text

Robustness of ANCOVA in randomised trials with unequal randomisation

JONATHAN W. BARTLETT

Department of Mathematical Sciences, University of Bath, Bath, BA2 7AY, UK

email: [email protected]

ORCID ID: 0000-0001-7117-0195

Abstract

Randomised trials with continuous outcomes are often analysed using ANCOVA, with adjustment for prognostic baseline covariates. In an article published recently, Wang *et al *proved that in this setting the model based standard error estimator for the treamtent effect is consistent under outcome model misspecification, provided the probability of randomisation to each treatment is 1/2. In this article, we extend their results allowing for unequal randomisation. These demonstrate that the model based standard error is in general inconsistent when the randomisation probability differs from 1/2. In contrast, the sandwich standard error can provide asymptotically valid inferences under misspecification when randomisation probabilities are not equal, and is therefore recommended when randomisation is unequal.

Keywords: ANCOVA, baseline adjustment, randomised trials

1 Introduction

In randomised trials with continuous outcomes the baseline covariate adjusted treatment effect estimator is consistent even if the assumed linear regression model (ANCOVA) is misspecified [4]. Recently Wang *et al *proved that under certain conditions, the model based variance estimator from an ANCOVA analysis of a randomised trial is valid under arbitrary misspecification, and therefore advocated its use for analysis of trials with continuous outcomes [3]. Concurrently, the US FDA have recently issued draft guidance on the topic of baseline covariate adjustment in randomised trials with continuous outcomes [1]. This draft guidance also advocates use of ANCOVA, and states that the type I error rate is controlled even when the model is misspecified.

An assumption used by Wang *et al *is that the probabilities of randomisation to the two arms are equal [3]. While this is commonly the case in randomised trials, many trials are conducted with unequal randomisation probabilities. In particular often the probability of randomisation to the experimental arm is greater than 1/2 in light of a hoped for improved outcome on the experimental treatment compared to control. In this article we explore the impact of violations of the equal randomisation probability assumption on the validity of the model based ANCOVA standard error, and thereby the impact on type I error and confidence interval coverage.

2 Model based ANCOVA variance estimation with unequal randomisation

Following the notation of Wang *et al *, we assume we observe $n$ i.i.d. copies of $(\mathbf{W},A,Y)$ , where $\mathbf{W}$ is a $k\times 1$ column vector of bounded baseline covariates, $A$ is the binary treatment group indicator ( $A=1$ for experimental treatment, $A=0$ for control) and $Y$ is the continuous outcome. Like Wang *et al *, we assume $A\perp\!\!\!\perp W$ , but we let $P(A=1)=\pi$ , where $\pi$ may differ from 1/2.

The target of inference is the average treatment effect $\Delta=E(Y|A=1)-E(Y|A=0)$ . The unadjusted estimator of $\Delta$ is the difference in treatment group sample means: $\hat{\Delta}^{unadj}=\sum^{n}_{i=1}Y_{i}A_{i}/\sum^{n}_{i=1}A_{i}-\sum^{n}_{i=1}Y_{i}(1-A_{i})/\sum^{n}_{i=1}(1-A_{i})$ . The ANCOVA estimator adjusts for the baseline covariates $\mathbf{W}$ by fitting the following linear regression model:

[TABLE]

where the regression coefficients are estimated by the ordinary least square estimators $\hat{\beta}_{0}$ , $\hat{\beta}_{A}$ , and $\hat{\beta}_{\mathbf{W}}$ . The ANCOVA estimator $\hat{\Delta}^{ancova}$ of $\Delta$ is $\hat{\Delta}^{ancova}=\hat{\beta}_{A}$ . We let $\underline{\beta}_{0}$ , $\underline{\beta}_{A}$ and $\underline{\beta}_{W}$ denote the probability limits of these estimators.

As noted by Wang *et al *, Yang & Tsiatis [4] and Tsiatis et al [2] proved, under the stated assumptions, that $\hat{\Delta}^{ancova}$ is a consistent estimator of $\Delta$ under arbitrary misspecification of the linear model in equation (1), so that $\underline{\beta}_{A}=\Delta$ . Following Wang *et al *, we let $Var^{*}(\hat{\Delta}^{ancova})$ denote the asymptotic variance of $\hat{\Delta}^{ancova}$ , in the sense that $n^{1/2}(\hat{\Delta}^{ancova}-\Delta)$ converges in distribution to a mean zero normal with variance $Var^{*}(\hat{\Delta}^{ancova})$ .

Inferences from ANCOVA are by default in statistical software packages based on the so called model based variance estimator for $\hat{\Delta}^{ancova}$ , which is given by

[TABLE]

where following Wang *et al *the estimated variances and covariances on the right hand side are sample variance and sample covariances, with degrees of freedom taken into account (see the Supporting Information of Wang et al [3] for precise definitions). Wang *et al *prove that when $\pi=1/2$ , $n\widehat{Var}(\hat{\Delta}^{ancova})$ converges in probability to the true asymptotic variance $Var^{*}(\hat{\Delta}^{ancova})$ . As a consequence, under these assumptions, asymptotically Wald-type hypothesis tests have the correct type I error under the null $\Delta=0$ and the corresponding confidence intervals attain their nominal coverage levels.

The following theorem, proved in the Supporting Information, gives the asymptotic variance of $\hat{\Delta}^{ancova}$ for arbitrary $0<\pi<1$ , generalising the results of Wang *et al *.

Theorem 1

Given the previously stated assumptions with $0<\pi<1$ , the true asymptotic variance $Var^{*}(\hat{\Delta}^{ancova})$ of the ANCOVA estimator $\hat{\Delta}^{ancova}$ is given by

[TABLE]

The next theorem, again proved in the Supporting Information,, gives the probability limit of $n\widehat{Var}(\hat{\Delta}^{ancova})$ under arbitrary $0<\pi<1$ .

Theorem 2

For the model based variance estimator $\widehat{Var}(\hat{\Delta}^{ancova})$ we have

[TABLE]

Together the two theorems imply that the model based variance estimator of $\hat{\Delta}^{ancova}$ is only asymptotically valid (and hence hypothesis tests and confidence intervals have correct asymptotic size and coverage) if $\pi=1/2$ , as assumed by Wang *et al *, or if $Var(Y-\underline{\beta}^{T}_{\mathbf{W}}\mathbf{W}|A=1)=Var(Y-\underline{\beta}^{T}_{\mathbf{W}}\mathbf{W}|A=0)$ . When $\pi\neq 1/2$ , the latter conditional variances are not in general equal under misspecification of the outcome model. For example, even if the conditional mean function $E(Y|A,\mathbf{W})$ is correctly specified, if the conditional variance of $Y$ given $W$ in the two treatment groups differ, the model based ANCOVA variance estimator is biased. Alternatively, even if $Var(Y|A=1)=Var(Y|A=0)$ , if $Cov(Y,\underline{\beta}^{T}_{\mathbf{W}}\mathbf{W}|A=1)\neq Cov(Y,\underline{\beta}^{T}_{\mathbf{W}}\mathbf{W}|A=0)$ , the model based ANCOVA variance estimator is again biased. This would in general occur if the outcome had the same variance in the two treatment groups, but the covariates $\mathbf{W}$ were prognostic for $Y$ to different extents in the two treatment groups.

We note that a special case of our result occurs when $W$ is empty, such that $\hat{\Delta}^{ancova}=\hat{\Delta}^{unadj}$ . In this case our result corresponds to the well known fact that the two sample t-test does not control the type I error rate in general if the outcome variable has different variance in the two groups, which leads to Welch’s adaptation of the t-test allowing for unequal variances.

Our results imply that when $\pi\neq 1/2$ , the model based ANCOVA variance estimator could be biased downwards or upwards, depending on the configuration, leading to a type I error rate either below or above the nominal level. Suppose for example that $\pi>1/2$ , such that a greater proportion of patients are randomised to the experimental treatment. Then if $Var(Y-\underline{\beta}^{T}_{\mathbf{W}}\mathbf{W}|A=1)>Var(Y-\underline{\beta}^{T}_{\mathbf{W}}\mathbf{W}|A=0)$ the model based ANCOVA variance is too large, leading to type I error rates lower than the nominal level, whereas if $Var(Y-\underline{\beta}^{T}_{\mathbf{W}}\mathbf{W}|A=1)<Var(Y-\underline{\beta}^{T}_{\mathbf{W}}\mathbf{W}|A=0)$ the model based ANCOVA variance is too small, leading to inflated type I error rates.

3 Discussion

We have shown that the model based ANCOVA variance estimator of the average treatment effect is under general misspecification of the outcome model inconsistent when $\pi\neq 0.5$ . In trials with unequal randomisation this variance estimator cannot therefore be recommended for general use. Instead, the sandwich variance estimator, as described by Tsiatis et al [2], provides asymptotically valid inferences for any randomisation probability under arbitrary misspecification. An important exception is if randomisation is not simple, as was assumed here and in Wang et al [3]. For example, as noted by Wang *et al *, under stratified randomisation schemes, obtaining asymptotically valid standard errors when covariates not used in the randomisation are adjusted for, under general misspecification of the outcome model, remains an open problem.

Acknowledgments

The author thanks David Wright and Daniel Jackson for useful discussions on the topic.

Supporting Information

We prove Theorems 1 and 2 of the main paper, referring frequently to the supporting information of Wang et al [3].

Proof of Theorem 1

Following the proof of Theorem B.2 of Wang *et al *, the estimating function corresponding to the ANCOVA regression is given by

[TABLE]

Then as noted by Wang *et al *, the OLS estimators $\hat{\beta}=(\hat{\beta}_{0},\hat{\beta}_{A},\hat{\beta}^{T}_{\mathbf{W}})$ are the solutions to the estimating equation $\sum^{n}_{i=1}\psi_{\hat{\beta}}(Y,A,\mathbf{W})=0$ and its probability limit $\underline{\beta}$ satisfies $E(\psi_{\underline{\beta}}(Y,A,\mathbf{W}))=0$ . The influence function of $\hat{\beta}$ is

[TABLE]

After some matrix algebra, and using the fact that $A\perp\!\!\!\perp W$ , one can show that the influence function of $\hat{\Delta}^{ancova}$ is

[TABLE]

Note that when $\pi=1/2$ , this reduces to the corresponding expression given by Wang *et al *. The asymptotic variance of the estimator $\hat{\Delta}^{ancova}$ is then given by the variance of this influence function. Since influence functions have mean zero, this variance is given by

[TABLE]

Then using the fact that $E(\psi_{\underline{\beta}}(Y,A,\mathbf{W}))=0$ , we have that $E\left(Y-\underline{\beta}_{0}-\underline{\beta}_{A}A-\underline{\beta}^{T}_{\mathbf{W}}\mathbf{W}\right)=0$ , and thus that

[TABLE]

as required.

Proof of Theorem 2

Theorem B.3 of Wang *et al *argues why $\widehat{Var}(Y-\hat{\beta}_{0}-\hat{\beta}_{A}A-\hat{\beta}^{T}_{\mathbf{W}}\mathbf{W})\xrightarrow{P}Var(Y-\underline{\beta}_{A}A-\underline{\beta}^{T}_{\mathbf{W}}\mathbf{W})$ , and their argument applies for any $0<\pi<1$ . Next, we have that $n/(n-1)\rightarrow 1$ , $\widehat{Var}(A)\xrightarrow{P}Var(A)=\pi(1-\pi)$ , by independence of $A$ and $\mathbf{W}$ $\widehat{Cov}(\mathbf{W},A)\xrightarrow{P}Cov(\mathbf{W},A)=\mathbf{0}$ , and $\widehat{Var}(\mathbf{W})\xrightarrow{P}Var(\mathbf{W})$ . Then from the definition of $\widehat{Var}(\hat{\Delta}^{ancova})$ it follows that

[TABLE]

Next, we write the variance in the numerator as

[TABLE]

The second of these terms can be expressed as

[TABLE]

Next, the fact that $E(\psi_{\underline{\beta}}(Y,A,\mathbf{W}))=0$ implies

[TABLE]

which in turn implies

[TABLE]

Then the fact that $E\left(Y-\underline{\beta}_{0}-\underline{\beta}_{A}A-\underline{\beta}^{T}_{\mathbf{W}}\mathbf{W}\right)=0$ means that

[TABLE]

and since $\pi E\left[Y-\underline{\beta}_{0}-\underline{\beta}_{A}A-\underline{\beta}^{T}_{\mathbf{W}}\mathbf{W}|A=1\right]=0$ , we have that

[TABLE]

Thus we have shown that

[TABLE]

and therefore also that

[TABLE]

We have thus shown

[TABLE]

and so

[TABLE]

as was required to be shown.

Bibliography4

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] U.S. Food and Drug Administration. Adjusting for Covariates in Randomized Clinical Trials for Drugs and Biologics with Continuous Outcomes. https://www.fda.gov/regulatory-information/search-fda-guidance-documents/adjusting-covariates-randomized-clinical-trials-drugs-and-biologics-continuous-outcomes-guidance , 2019.
2[2] Anastasios A Tsiatis, Marie Davidian, Min Zhang, and Xiaomin Lu. Covariate adjustment for two-sample treatment comparisons in randomized clinical trials: a principled yet flexible approach. Statistics in Medicine , 27(23):4658–4677, 2008.
3[3] Bingkai Wang, Elizabeth L Ogburn, and Michael Rosenblum. Analysis of covariance (ancova) in randomized trials: More precision and valid confidence intervals, without model assumptions. Biometrics , 2019.
4[4] Li Yang and Anastasios A Tsiatis. Efficiency study of estimators for a treatment effect in a pretest–posttest trial. The American Statistician , 55(4):314–321, 2001.