Equivalence of regression curves sharing common parameters

Kathrin M\"ollenhoff; Frank Bretz; Holger Dette

arXiv:1902.03456·stat.ME·February 12, 2019

Equivalence of regression curves sharing common parameters

Kathrin M\"ollenhoff, Frank Bretz, Holger Dette

PDF

TL;DR

This paper introduces a bootstrap test for comparing two regression curves sharing common parameters, demonstrating its effectiveness through theory, simulations, and a clinical trial example.

Contribution

It develops a new bootstrap test for assessing the similarity of regression curves with shared parameters, improving power over traditional methods.

Findings

01

Test controls level effectively

02

Achieves higher power with shared parameters

03

Validated through simulation and clinical trial example

Abstract

In clinical trials the comparison of two different populations is a frequently addressed problem. Non-linear (parametric) regression models are commonly used to describe the relationship between covariates as the dose and a response variable in the two groups. In some situations it is reasonable to assume some model parameters to be the same, for instance the placebo effect or the maximum treatment effect. In this paper we develop a (parametric) bootstrap test to establish the similarity of two regression curves sharing some common parameters. We show by theoretical arguments and by means of a simulation study that the new test controls its level and achieves a reasonable power. Moreover, it is demonstrated that under the assumption of common parameters a considerable more powerful test can be constructed compared to the test which does not use this assumption. Finally, we illustrate…

Figures5

Click any figure to enlarge with its caption.

Tables5

Table 1. Table 1: Simulated Type I error of the bootstrap test ( 2.9 ) for the equivalence of two sigmoid Emax models defined in Scenario 1 with ε = 1 𝜀 1 \varepsilon=1 . The numbers in brackets show the simulated Type I error when fixing the Hill parameter at β 0 , 3 = 4 subscript 𝛽 0 3 4 \beta_{0,3}=4 .

		$α = 0.05$			$α = 0.1$
$n_{ℓ}$	$d_{\infty}$	$σ^{2} = 1$	$σ^{2} = 2$	$σ^{2} = 3$	$σ^{2} = 1$	$σ^{2} = 2$	$σ^{2} = 3$
$30$	2	0.002 (0.000)	0.010 (0.005)	0.012 (0.009)	0.002 (0.004)	0.018 (0.013)	0.031 (0.019)
$30$	1.5	0.014 (0.013)	0.035 (0.028)	0.027 (0.043)	0.026 (0.020)	0.055 (0.049)	0.052 (0.070)
$30$	1	0.068 (0.058)	0.054 (0.065)	0.051 (0.060)	0.106 (0.099)	0.109 (0.114)	0.115 (0.121)
$90$	2	0.000 (0.000)	0.000 (0.000)	0.001 (0.002)	0.000 (0.000)	0.000 (0.000)	0.002 (0.002)
$90$	1.5	0.004 (0.001)	0.010 (0.006)	0.013 (0.015)	0.005 (0.005)	0.021 (0.014)	0.026 (0.020)
$90$	1	0.048 (0.071)	0.053 (0.043)	0.062 (0.059)	0.104 (0.122)	0.101 (0.097)	0.129 (0.117)
$150$	2	0.000 (0.000)	0.000 (0.000)	0.000 (0.000)	0.000 (0.000)	0.000 (0.000)	0.000 (0.004)
$150$	1.5	0.000 (0.000)	0.003 (0.000)	0.002 (0.000)	0.002 (0.000)	0.011 (0.002)	0.012 (0.009)
$150$	1	0.056 (0.061)	0.040 (0.061)	0.042 (0.063)	0.103 (0.102)	0.090 (0.102)	0.096 (0.109)

Table 2. Table 2: RRMSE of the parameters obtained in the model estimation step of the bootstrap test ( 2.9 ) for the equivalence of two sigmoid Emax models defined in Scenario 1 with d ∞ = 1 subscript 𝑑 1 d_{\infty}=1 . The numbers in brackets show the values for the RRMSE when fixing the Hill parameter at β 0 , 3 = 4 subscript 𝛽 0 3 4 \beta_{0,3}=4 .

$n_{ℓ}$	$σ^{2}$	$β_{0, 1}$	$β_{0, 2}$	$β_{0, 3}$	${\tilde{β}}_{1, 4}$	${\tilde{β}}_{2, 4}$
$30$	1	0.288 (0.263)	0.091 (0.062)	0.493	0.125 (0.108)	0.114 (0.106)
$30$	2	0.389 (0.358)	0.118 (0.090)	0.739	0.174 (0.157)	0.172 (0.148)
$30$	3	0.460 (0.427)	0.134 (0.105)	0.841	0.228 (0.198)	0.211 (0.185)
$90$	1	0.166 (0.152)	0.054 (0.036)	0.184	0.067 (0.064)	0.063 (0.061)
$90$	2	0.237 (0.219)	0.076 (0.054)	0.355	0.107 (0.086)	0.091 (0.086)
$90$	3	0.280 (0.261)	0.091 (0.062)	0.450	0.130 (0.112)	0.113 (0.105)
$150$	1	0.123 (0.115)	0.040 (0.029)	0.129	0.050 (0.057)	0.049 (0.046)
$150$	2	0.171 (0.170)	0.057 (0.041)	0.241	0.073 (0.072)	0.068 (0.063)
$150$	3	0.219 (0.204)	0.070 (0.050)	0.322	0.095 (0.086)	0.091 (0.084)

Table 3. Table 3: Simulated power of the bootstrap test ( 2.9 ) for the equivalence of two sigmoid Emax models defined in Scenario 1 with ε = 1 𝜀 1 \varepsilon=1 . The numbers in brackets show the simulated power when fixing the Hill parameter at β 0 , 3 = 4 subscript 𝛽 0 3 4 \beta_{0,3}=4 .

		$α = 0.05$			$α = 0.1$
$n_{ℓ}$	$d_{\infty}$	$σ^{2} = 1$	$σ^{2} = 2$	$σ^{2} = 3$	$σ^{2} = 1$	$σ^{2} = 2$	$σ^{2} = 3$
$30$	0.5	0.137 (0.154)	0.075 (0.078)	0.070 (0.073)	0.238 (0.266)	0.172 (0.155)	0.137 (0.145)
$30$	0.25	0.208 (0.190)	0.102 (0.101)	0.081 (0.088)	0.344 (0.349)	0.196 (0.188)	0.152 (0.170)
$30$	0	0.181 (0.203)	0.105 (0.105)	0.086 (0.092)	0.333 (0.361)	0.196 (0.213)	0.154 (0.154)
$90$	0.5	0.341 (0.424)	0.180 (0.230)	0.132 (0.153)	0.505 (0.581)	0.311 (0.357)	0.246 (0.279)
$90$	0.25	0.550 (0.675)	0.249 (0.315)	0.166 (0.190)	0.733 (0.802)	0.428 (0.484)	0.305 (0.348)
$90$	0	0.664 (0.783)	0.286 (0.353)	0.191 (0.188)	0.822 (0.884)	0.463 (0.562)	0.338 (0.367)
$150$	0.5	0.481 (0.593)	0.297 (0.359)	0.207 (0.273)	0.635 (0.729)	0.460 (0.502)	0.355 (0.406)
$150$	0.25	0.826 (0.868)	0.448 (0.569)	0.280 (0.357)	0.902 (0.933)	0.635 (0.719)	0.477 (0.545)
$150$	0	0.917 (0.961)	0.559 (0.665)	0.342 (0.415)	0.966 (0.989)	0.740 (0.812)	0.520 (0.596)

Table 4. Table 4: Simulated Type I error of the bootstrap test ( 2.9 ) for the equivalence of two sigmoid Emax models defined in Scenario 2 with ε = 1 𝜀 1 \varepsilon=1 . The numbers in brackets show the simulated Type I error when fixing the Hill parameters at their true underlying values.

		$α = 0.05$			$α = 0.1$
$n_{ℓ}$	$d_{\infty}$	$σ^{2} = 1$	$σ^{2} = 2$	$σ^{2} = 3$	$σ^{2} = 1$	$σ^{2} = 2$	$σ^{2} = 3$
$30$	2	0.002 (0.000)	0.001 (0.000)	0.001 (0.000)	0.000 (0.005)	0.003 (0.007)	0.006 (0.011)
$30$	1.5	0.000 (0.001)	0.003 (0.000)	0.008 (0.001)	0.007 (0.033)	0.002 (0.030)	0.016 (0.054)
$30$	1	0.036 (0.035)	0.044 (0.040)	0.053 (0.039)	0.083 (0.099)	0.110 (0.116)	0.113 (0.112)
$90$	2	0.000 (0.000)	0.000 (0.000)	0.000 (0.000)	0.000 (0.000)	0.000 (0.002)	0.004 (0.002)
$90$	1.5	0.000 (0.000)	0.000 (0.000)	0.004 (0.000)	0.000 (0.007)	0.004 (0.021)	0.012 (0.022)
$90$	1	0.052 (0.057)	0.028 (0.040)	0.016 (0.036)	0.104 (0.113)	0.068 (0.117)	0.056 (0.105)
$150$	2	0.000 (0.000)	0.000 (0.000)	0.000 (0.000)	0.000 (0.000)	0.000 (0.000)	0.000 (0.000)
$150$	1.5	0.000 (0.000)	0.000 (0.000)	0.004 (0.000)	0.000 (0.000)	0.000 (0.004)	0.004 (0.004)
$150$	1	0.056 (0.032)	0.036 (0.036)	0.040 (0.036)	0.100 (0.088)	0.080 (0.100)	0.076 (0.088)

Table 5. Table 5: Simulated power of the bootstrap test ( 2.9 ) for the equivalence of two sigmoid Emax models defined in Scenario 2 with ε = 1 𝜀 1 \varepsilon=1 . The numbers in brackets show the simulated power when fixing the Hill parameters at their true underlying values.

		$α = 0.05$			$α = 0.1$
$n_{ℓ}$	$d_{\infty}$	$σ^{2} = 1$	$σ^{2} = 2$	$σ^{2} = 3$	$σ^{2} = 1$	$σ^{2} = 2$	$σ^{2} = 3$
$30$	0.5	0.130 (0.147)	0.096 (0.075)	0.094 (0.072)	0.231 (0.245)	0.191 (0.133)	0.174 (0.125)
$30$	0.25	0.156 (0.194)	0.102 (0.089)	0.087 (0.085)	0.275 (0.319)	0.197 (0.178)	0.157 (0.147)
$30$	0	0.155 (0.164)	0.108 (0.076)	0.087 (0.059)	0.310 (0.316)	0.197 (0.166)	0.189 (0.137)
$90$	0.5	0.312 (0.542)	0.124 (0.225)	0.140 (0.133)	0.528 (0.697)	0.240 (0.384)	0.260 (0.240)
$90$	0.25	0.384 (0.689)	0.224 (0.289)	0.164 (0.173)	0.560 (0.841)	0.396 (0.484)	0.292 (0.313)
$90$	0	0.448 (0.663)	0.192 (0.259)	0.148 (0.163)	0.616 (0.807)	0.372 (0.455)	0.240 (0.309)
$150$	0.5	0.528 (0.780)	0.220 (0.392)	0.160 (0.228)	0.688 (0.896)	0.404 (0.632)	0.256 (0.400)
$150$	0.25	0.724 (0.904)	0.320 (0.544)	0.260 (0.304)	0.824 (0.936)	0.540 (0.740)	0.408 (0.516)
$150$	0	0.644 (0.920)	0.308 (0.580)	0.224 (0.248)	0.800 (0.956)	0.532 (0.728)	0.404 (0.484)

Equations105

Y_{ℓ, i, j} = m_{ℓ} (d_{ℓ, i}, β_{ℓ}) + η_{ℓ, i, j}, j = 1, \dots, n_{ℓ, i}, i = 1, \dots, k_{ℓ},

Y_{ℓ, i, j} = m_{ℓ} (d_{ℓ, i}, β_{ℓ}) + η_{ℓ, i, j}, j = 1, \dots, n_{ℓ, i}, i = 1, \dots, k_{ℓ},

β_{ℓ} = (β_{0}, \tilde{β}_{ℓ}) \in R^{p_{ℓ}}, ℓ = 1, 2,

β_{ℓ} = (β_{0}, \tilde{β}_{ℓ}) \in R^{p_{ℓ}}, ℓ = 1, 2,

\displaystyle\hat{\beta}=(\hat{\beta}_{0},\hat{\tilde{\beta}}_{1},\hat{\tilde{\beta}}_{2})=\operatorname{arg\,min}\limits_{(b,,\tilde{b}_{1},,\tilde{b}_{2})\in B}\sum_{\ell=1}^{2}\sum_{i=1}^{k_{\ell}}\sum_{j=1}^{n_{\ell,i}}\big{(}Y_{\ell,i,j}-m_{\ell}(d_{\ell,i},(b_{0},\tilde{b}_{\ell}))\big{)}^{2}.

\displaystyle\hat{\beta}=(\hat{\beta}_{0},\hat{\tilde{\beta}}_{1},\hat{\tilde{\beta}}_{2})=\operatorname{arg\,min}\limits_{(b,,\tilde{b}_{1},,\tilde{b}_{2})\in B}\sum_{\ell=1}^{2}\sum_{i=1}^{k_{\ell}}\sum_{j=1}^{n_{\ell,i}}\big{(}Y_{\ell,i,j}-m_{\ell}(d_{\ell,i},(b_{0},\tilde{b}_{\ell}))\big{)}^{2}.

d_{\infty} (β_{1}, β_{2}) = d \in D max ∣ m_{1} (d, β_{1}) - m_{2} (d, β_{2}) ∣ < ε .

d_{\infty} (β_{1}, β_{2}) = d \in D max ∣ m_{1} (d, β_{1}) - m_{2} (d, β_{2}) ∣ < ε .

H_{0} : d_{\infty} (β_{1}, β_{2}) \geq ε \mbox v er s u s H_{1} : d_{\infty} (β_{1}, β_{2}) < ε .

H_{0} : d_{\infty} (β_{1}, β_{2}) \geq ε \mbox v er s u s H_{1} : d_{\infty} (β_{1}, β_{2}) < ε .

\overset{σ}{^}_{ℓ}^{2} = \frac{1}{n _{ℓ}} i = 1 \sum k_{ℓ} j = 1 \sum n_{ℓ, i} (Y_{ℓ, i, j} - m_{ℓ} (d_{ℓ, i}, \hat{β}_{ℓ}))^{2}, ℓ = 1, 2,

\overset{σ}{^}_{ℓ}^{2} = \frac{1}{n _{ℓ}} i = 1 \sum k_{ℓ} j = 1 \sum n_{ℓ, i} (Y_{ℓ, i, j} - m_{ℓ} (d_{ℓ, i}, \hat{β}_{ℓ}))^{2}, ℓ = 1, 2,

\hat{d}_{\infty} = d_{\infty} (\hat{β}_{1}, \hat{β}_{2}) = d \in D max ∣ m_{1} (d, \hat{β}_{1}) - m_{2} (d, \hat{β}_{2}) ∣

\hat{d}_{\infty} = d_{\infty} (\hat{β}_{1}, \hat{β}_{2}) = d \in D max ∣ m_{1} (d, \hat{β}_{1}) - m_{2} (d, \hat{β}_{2}) ∣

{\hat{\hat{\beta}}_{\ell}}=\left\{\begin{array}[]{ccc}\hat{\beta}_{\ell}&\mbox{if}&\hat{d}_{\infty}\geq\varepsilon\\ \bar{\beta}_{\ell}&\mbox{if}&\hat{d}_{\infty}<\varepsilon\end{array}\right.,\qquad\ell=1,2,

{\hat{\hat{\beta}}_{\ell}}=\left\{\begin{array}[]{ccc}\hat{\beta}_{\ell}&\mbox{if}&\hat{d}_{\infty}\geq\varepsilon\\ \bar{\beta}_{\ell}&\mbox{if}&\hat{d}_{\infty}<\varepsilon\end{array}\right.,\qquad\ell=1,2,

d_{\infty} (β_{1}, β_{2}) = d \in D max ∣ m_{1} (d, β_{1}) - m_{2} (d, β_{2}) ∣ = ε .

d_{\infty} (β_{1}, β_{2}) = d \in D max ∣ m_{1} (d, β_{1}) - m_{2} (d, β_{2}) ∣ = ε .

Y_{ℓ, i, j}^{*} = m_{ℓ} (d_{ℓ, i}, (\hat{\hat{β}}_{0}, \hat{\hat{\tilde{β}}}_{ℓ})) + η_{ℓ, i, j}^{*}, i = 1, \dots, n_{ℓ, i}, ℓ = 1, 2,

Y_{ℓ, i, j}^{*} = m_{ℓ} (d_{ℓ, i}, (\hat{\hat{β}}_{0}, \hat{\hat{\tilde{β}}}_{ℓ})) + η_{ℓ, i, j}^{*}, i = 1, \dots, n_{ℓ, i}, ℓ = 1, 2,

\hat{d}_{\infty}^{*} = d \in D max ∣ m_{1} (d, \hat{β}_{1}^{*}) - m_{2} (d, \hat{β}_{2}^{*}) ∣,

\hat{d}_{\infty}^{*} = d \in D max ∣ m_{1} (d, \hat{β}_{1}^{*}) - m_{2} (d, \hat{β}_{2}^{*}) ∣,

\hat{d}_{\infty} < \overset{q}{^}_{α}^{*} .

\hat{d}_{\infty} < \overset{q}{^}_{α}^{*} .

\lim_{n_{1},n_{2}\rightarrow\infty}\mathbb{P}\big{(}\hat{d}_{\infty}<\hat{q}_{\alpha}^{*}\big{)}=1,

\lim_{n_{1},n_{2}\rightarrow\infty}\mathbb{P}\big{(}\hat{d}_{\infty}<\hat{q}_{\alpha}^{*}\big{)}=1,

\limsup_{n_{1},n_{2}\rightarrow\infty}\mathbb{P}\big{(}\hat{d}_{\infty}<\hat{q}_{\alpha}^{*}\big{)}\leq\alpha.

\limsup_{n_{1},n_{2}\rightarrow\infty}\mathbb{P}\big{(}\hat{d}_{\infty}<\hat{q}_{\alpha}^{*}\big{)}\leq\alpha.

m_{ℓ} (d, β_{ℓ}) = β_{0, 1} + \tilde{β}_{ℓ, 1} \cdot m_{ℓ}^{0} (d, \tilde{β}_{ℓ}^{0}), ℓ = 1, 2, i = 1, \dots, k_{ℓ} .

m_{ℓ} (d, β_{ℓ}) = β_{0, 1} + \tilde{β}_{ℓ, 1} \cdot m_{ℓ}^{0} (d, \tilde{β}_{ℓ}^{0}), ℓ = 1, 2, i = 1, \dots, k_{ℓ} .

\hat{\beta}=(\hat{\beta}_{0},\hat{\tilde{\beta}}_{1},\hat{\tilde{\beta}}_{2})=\operatorname{arg\,min}_{b\in B}\sum_{j=1}^{n_{0}}\big{(}Y_{0,j}-b_{0,1}\big{)}^{2}+\sum_{\ell=1}^{2}\sum_{i=2}^{k_{\ell}}\sum_{j=1}^{n_{\ell,i}}\big{(}Y_{\ell,i,j}-(b_{0,1}+\tilde{b}_{\ell,1}\cdot m_{\ell}^{0}(d_{\ell,i},\tilde{b}^{0}_{\ell}))\big{)}^{2}.

\hat{\beta}=(\hat{\beta}_{0},\hat{\tilde{\beta}}_{1},\hat{\tilde{\beta}}_{2})=\operatorname{arg\,min}_{b\in B}\sum_{j=1}^{n_{0}}\big{(}Y_{0,j}-b_{0,1}\big{)}^{2}+\sum_{\ell=1}^{2}\sum_{i=2}^{k_{\ell}}\sum_{j=1}^{n_{\ell,i}}\big{(}Y_{\ell,i,j}-(b_{0,1}+\tilde{b}_{\ell,1}\cdot m_{\ell}^{0}(d_{\ell,i},\tilde{b}^{0}_{\ell}))\big{)}^{2}.

β_{ℓ} = (β_{ℓ, 1}, \dots, β_{ℓ, p^{'}}, \dots, β_{ℓ, p_{ℓ}}), ℓ = 1, 2.

β_{ℓ} = (β_{ℓ, 1}, \dots, β_{ℓ, p^{'}}, \dots, β_{ℓ, p_{ℓ}}), ℓ = 1, 2.

K_{0} : i = 1, \dots, p^{'} max ∣ β_{1, i} - β_{2, i} ∣ \geq δ \mbox v er s u s K_{1} : i = 1, \dots, p^{'} max ∣ β_{1, i} - β_{2, i} ∣ < δ,

K_{0} : i = 1, \dots, p^{'} max ∣ β_{1, i} - β_{2, i} ∣ \geq δ \mbox v er s u s K_{1} : i = 1, \dots, p^{'} max ∣ β_{1, i} - β_{2, i} ∣ < δ,

n_{ℓ} \to \infty lim \frac{n _{ℓ, i}}{n _{ℓ}}

n_{ℓ} \to \infty lim \frac{n _{ℓ, i}}{n _{ℓ}}

n_{1}, n_{2} \to \infty lim \frac{n}{n _{1}}

n_{ℓ} (\hat{β}^{(ℓ)} - β_{ℓ}) \to D N (0, Σ_{ℓ}^{- 1}), ℓ = 1, 2,

n_{ℓ} (\hat{β}^{(ℓ)} - β_{ℓ}) \to D N (0, Σ_{ℓ}^{- 1}), ℓ = 1, 2,

\Sigma_{\ell}={1\over\sigma_{\ell}^{2}}\sum_{i=1}^{k_{\ell}}\zeta_{\ell,i}\tfrac{\partial}{\partial b_{\ell}}m_{\ell}(d_{\ell,i,},b_{\ell})\big{|}_{b_{\ell}=\beta_{\ell}}\big{(}\tfrac{\partial}{\partial b_{\ell}}m_{\ell}(d_{\ell,i,},b_{\ell})\big{|}_{b_{\ell}=\beta_{\ell}}\big{)}^{T}~{},~{}~{}\ell=1,2.

\Sigma_{\ell}={1\over\sigma_{\ell}^{2}}\sum_{i=1}^{k_{\ell}}\zeta_{\ell,i}\tfrac{\partial}{\partial b_{\ell}}m_{\ell}(d_{\ell,i,},b_{\ell})\big{|}_{b_{\ell}=\beta_{\ell}}\big{(}\tfrac{\partial}{\partial b_{\ell}}m_{\ell}(d_{\ell,i,},b_{\ell})\big{|}_{b_{\ell}=\beta_{\ell}}\big{)}^{T}~{},~{}~{}\ell=1,2.

\sqrt{n}\big{(}(\hat{\beta}_{1,1},\ldots,\hat{\beta}_{1,p^{\prime}})-(\hat{\beta}_{2,1},\ldots,\hat{\beta}_{2,p^{\prime}})-\big{(}(\beta_{1,1},\ldots,\beta_{1,p^{\prime}})-(\beta_{2,1},\ldots,\beta_{2,p^{\prime}})\big{)}\big{)}\stackrel{{\scriptstyle\mathcal{D}}}{{\rightarrow}}\mathcal{N}(0,\Omega),

\sqrt{n}\big{(}(\hat{\beta}_{1,1},\ldots,\hat{\beta}_{1,p^{\prime}})-(\hat{\beta}_{2,1},\ldots,\hat{\beta}_{2,p^{\prime}})-\big{(}(\beta_{1,1},\ldots,\beta_{1,p^{\prime}})-(\beta_{2,1},\ldots,\beta_{2,p^{\prime}})\big{)}\big{)}\stackrel{{\scriptstyle\mathcal{D}}}{{\rightarrow}}\mathcal{N}(0,\Omega),

Ω := λ Λ_{1}^{- 1} + \frac{λ}{λ - 1} Λ_{2}^{- 1},

Ω := λ Λ_{1}^{- 1} + \frac{λ}{λ - 1} Λ_{2}^{- 1},

(\hat{β}_{1, 1}, \dots, \hat{β}_{1, p^{'}}) - (\hat{β}_{2, 1}, \dots, \hat{β}_{2, p^{'}}) \approx D N ((β_{1, 1}, \dots, β_{1, p^{'}}) - (β_{2, 1}, \dots, β_{2, p^{'}}), \frac{1}{n} Ω),

(\hat{β}_{1, 1}, \dots, \hat{β}_{1, p^{'}}) - (\hat{β}_{2, 1}, \dots, \hat{β}_{2, p^{'}}) \approx D N ((β_{1, 1}, \dots, β_{1, p^{'}}) - (β_{2, 1}, \dots, β_{2, p^{'}}), \frac{1}{n} Ω),

\displaystyle|\hat{\beta}_{1,i}-\hat{\beta}_{2,i}|<\delta-t_{1-\alpha,n-2}\big{(}\tfrac{\hat{\Omega}_{ii}}{n(n-2)}\big{)}^{1/2}\text{ for all $i=1,\ldots,p^{\prime}$},

\displaystyle|\hat{\beta}_{1,i}-\hat{\beta}_{2,i}|<\delta-t_{1-\alpha,n-2}\big{(}\tfrac{\hat{\Omega}_{ii}}{n(n-2)}\big{)}^{1/2}\text{ for all $i=1,\ldots,p^{\prime}$},

Y_{ℓ, i, j} = m_{ℓ} (d_{ℓ, i}, (β_{0}, \tilde{β}_{ℓ})) + η_{ℓ, i, j}, j = 1, \dots, n_{ℓ, i}, i = 1, \dots k_{ℓ}, ℓ = 1, 2.

Y_{ℓ, i, j} = m_{ℓ} (d_{ℓ, i}, (β_{0}, \tilde{β}_{ℓ})) + η_{ℓ, i, j}, j = 1, \dots, n_{ℓ, i}, i = 1, \dots k_{ℓ}, ℓ = 1, 2.

m (d, β) = β_{1} + \frac{β _{2} d ^{β_{3}}}{β _{4}^{β_{3}} + d ^{β_{3}}},

m (d, β) = β_{1} + \frac{β _{2} d ^{β_{3}}}{β _{4}^{β_{3}} + d ^{β_{3}}},

m_{1} (d, β_{1}) = β_{0, 1} + \frac{β _{0, 2} d ^{β_{0, 3}}}{β ~ _{1, 4}^{β_{0, 3}} + d ^{β_{0, 3}}} \mbox an d m_{2} (d, β_{2}) = β_{0, 1} + \frac{β _{0, 2} d ^{β_{0, 3}}}{β ~ _{2, 4}^{β_{0, 3}} + d ^{β_{0, 3}}} .

m_{1} (d, β_{1}) = β_{0, 1} + \frac{β _{0, 2} d ^{β_{0, 3}}}{β ~ _{1, 4}^{β_{0, 3}} + d ^{β_{0, 3}}} \mbox an d m_{2} (d, β_{2}) = β_{0, 1} + \frac{β _{0, 2} d ^{β_{0, 3}}}{β ~ _{2, 4}^{β_{0, 3}} + d ^{β_{0, 3}}} .

m_{1} (d, β_{1}) = β_{0, 1} + \frac{β _{0, 2} d ^{\tilde{β}_{1, 3}}}{β ~ _{1, 4}^{\tilde{β}_{1, 3}} + d ^{\tilde{β}_{1, 3}}} \mbox an d m_{2} (d, β_{2}) = β_{0, 1} + \frac{β _{0, 2} d ^{\tilde{β}_{2, 3}}}{β ~ _{2, 4}^{\tilde{β}_{2, 3}} + d ^{\tilde{β}_{2, 3}}} .

m_{1} (d, β_{1}) = β_{0, 1} + \frac{β _{0, 2} d ^{\tilde{β}_{1, 3}}}{β ~ _{1, 4}^{\tilde{β}_{1, 3}} + d ^{\tilde{β}_{1, 3}}} \mbox an d m_{2} (d, β_{2}) = β_{0, 1} + \frac{β _{0, 2} d ^{\tilde{β}_{2, 3}}}{β ~ _{2, 4}^{\tilde{β}_{2, 3}} + d ^{\tilde{β}_{2, 3}}} .

(\tilde{β}_{2, 3}, \tilde{β}_{2, 4}) = (0.81, 0.86), (\tilde{β}_{2, 3}, \tilde{β}_{2, 4}) = (1.4, 1.07), (\tilde{β}_{2, 3}, \tilde{β}_{2, 4}) = (2.15, 1.18),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Equivalence of regression curves sharing common parameters

Kathrin Möllenhoff1, Frank Bretz2 , Holger Dette1

1 Department of Mathematics, Ruhr-Universität Bochum, Germany,

2Novartis Pharma AG, CH-4002 Basel, Switzerland

Abstract

In clinical trials the comparison of two different populations is a frequently addressed problem. Non-linear (parametric) regression models are commonly used to describe the relationship between covariates as the dose and a response variable in the two groups. In some situations it is reasonable to assume some model parameters to be the same, for instance the placebo effect or the maximum treatment effect. In this paper we develop a (parametric) bootstrap test to establish the similarity of two regression curves sharing some common parameters. We show by theoretical arguments and by means of a simulation study that the new test controls its level and achieves a reasonable power. Moreover, it is demonstrated that under the assumption of common parameters a considerable more powerful test can be constructed compared to the test which does not use this assumption. Finally, we illustrate potential applications of the new methodology by a clinical trial example.

Keywords and Phrases: Similarity of regression curves, equivalence testing, parametric bootstrap, nonlinear regression, dose finding studies

1 Introduction

Regression models are commonly used to describe the relationship between multiple covariates and a response variable. In certain applications, more than one regression model is available, such as when assessing the relationship between the covariates and the response variables in more than one population (e.g. in males and females). It is then often of interest to demonstrate the equivalence of the regression curves: If equivalence can be claimed, conclusions can be drawn from the pooled sample and a single regression model is sufficient to describe the data. This can be achieved by testing a suitable null hypothesis that the distance between the regression curves (measured in an appropriate sense) is smaller than a pre-specified equivalence margin at a controlled Type I error rate. Note that the problem of equivalence testing, as considered in this paper, is conceptually different from the more frequent problem of testing for equality of curves and is much less studied in the literature due to methodological difficulties.

The problem of testing for equality of regression models has been intensively discussed in the nonparametric context and we refer to the recent work of Feng et al., (2015), whichcontains a rather comprehensive list of references. In applied regression analysis, however, parametric models are usually preferred to a purely nonparametric approach as they admit a direct interpretation of the observed effects in terms of the model parameters. In addition, the available information of the observations is increased by applying more efficient estimation or test procedures, provided that the assumed model is valid. Despite its importance, the problem of establishing equivalence of two parametric regression models while controlling the Type I error rate has only recently found attention in the literature. Using the intersection-union test device from Berger, (1982), Liu et al., (2009) investigated the assessment of non-superiority, non-inferiority and equivalence when comparing two regression models over a restricted covariate region. Building upon this work, Gsteiger et al., (2011) derived equivalence tests based on simultaneous confidence bands for nonlinear regression models, with application to population pharmacokinetic analyses. Likewise, Bretz et al., (2016) assessed the similarity of dose response curves in two non-overlapping subgroups of patients. Alternatively, Dette et al., (2018) suggested directly estimating the distance between the regression curves and using a non-standard bootstrap test to decide for equivalence of the two curves if the estimate is less than a certain threshold. Expanding this approach, Moellenhoff et al., (2018) assessed the comparability of drug dissolution profiles via maximum deviation, whereas Hoffelder, (2018) demonstrated the equivalence of dissolution profiles using the Mahalanobis distance; see also Collignon et al., (2018).

In these papers, the authors assumed that the regression models have different parameters and can therefore be evaluated separately. In some applications, however, this assumption cannot be justified and it is more reasonable to assume that the regression models may have some common parameters. The total number of parameters to estimate is then reduced to the common and remaining parameters of each model, affecting the asymptotic behavior of the estimators. Consider, for example, the Phase II dose finding trial for a weight loss drug described in Bretz et al., (2016). This trial aimed at comparing the dose response relationship for two regimens administered to patients suffering from overweight or obesity: Three doses each for once daily (o.d.) and twice daily (b.i.d.) use of the medication, and placebo. It is reasonable to assume that the placebo response is the same under both the o.d. and the b.i.d. regimen. Since the regression models typically used for dose response modeling contain a parameter for the placebo response (Pinheiro et al., (2006)), they will thus share this common parameter for both the o.d. and the b.i.d. regimen. In some instances, it might even be reasonable to assume that the maximum efficacy for high doses is similar in both groups. Moreover, clinical trial sponsors may even decide to use the same placebo group for logistical reasons. The response of each patient on placebo is then used twice in the estimation of the o.d. and b.i.d. dose response models, further complicating the statistical problem.

In this paper, we investigate the equivalence of two parametric regression curves that share common parameters. In Section 2 we first introduce the regression models to be estimated under the assumption of common parameters. We then develop a non-standard bootstrap test which performs the resampling under the constraints of the interval hypotheses implied by the equivalence test problem. The new tests improves the procedure proposed in Dette et al., (2018) using the additional information of common parameter in both groups. We also discuss testing the equivalence of model parameters to assess whether the assumption of common parameters is plausible. In Section 3 we investigate the finite sample properties of the proposed bootstrap test proposed in terms of power and size. In Section 4 we illustrate the methods using a multi-regional clinical trial example where it is conceivable that the placebo and maximum treatment responses are the same across geographic regions but the onset of treatment differs due to intrinsic and extrinsic factors (Malinowski et al., (2008); ICH, (2017)). Technical details and proofs are deferred to an appendix.

2 Methodology

2.1 Models with common parameters

Let

[TABLE]

denote the observed response of the $j$ th subject at the $i$ th dose level $d_{\ell,i}$ under the $\ell$ th dose response model $m_{\ell}$ , where $\ell=1,2$ denotes the index of the two groups under consideration. We assume that the (non-linear) regression model $m_{\ell}$ is parametrized through a $p_{\ell}$ -dimensional vector $\beta_{\ell}$ , $\ell=1,2$ . Note that the regression models $m_{1}$ and $m_{2}$ may be different. Likewise, the parameters $\beta_{1}$ and $\beta_{2}$ may be different even if $m_{1}=m_{2}$ . We further assume that the error terms $\eta_{\ell,i,j}$ are independent and identically distributed with expectation [math] and variance $\sigma_{\ell}^{2}$ . The dose levels $d_{\ell,i}$ may be different in both groups but they are attained on the same (restricted) covariate region $\cal D$ . In this paper $\cal D$ is assumed to be the dose range, although the results can be generalized to include other covariates. Further, $n_{\ell}=\sum_{i=1}^{k_{\ell}}n_{\ell,i}$ denotes the sample size in group $\ell$ where we assume $n_{\ell,i}$ observations in the $i$ th dose level ( $i=1,\ldots,k_{\ell},\ \ell=1,2)$ . The sample sizes $n_{\ell}$ can be unequal and the total number of observations is denoted by $n=n_{1}+n_{2}$ .

In this paper we consider the situation, where the regression models have some common parameters. More precisely, we assume without loss of generality that these parameters are given by the first $p^{\prime}$ model parameters of the parameter $\beta_{\ell}$ in model (2.1) that is

[TABLE]

where $\beta_{0}\in\mathbb{R}^{p^{\prime}}$ denotes the vector of common parameters in both regression models and $\tilde{\beta}_{1}$ and $\tilde{\beta}_{2}$ denote the remaining parameters in the models $m_{1}$ and $m_{2}$ , respectively, which do not necessarily coincide. The case where the models $m_{1}$ and $m_{2}$ do not share any common parameters is included and corresponds to $\beta_{\ell}=\tilde{\beta}_{\ell}$ for $\ell=1,2$ (that is $p^{\prime}=0$ ). As a consequence the $p_{1}+p_{2}-p^{\prime}$ -dimensional vector of all parameters of the regression functions in model (2.1) under the assumption (2.2) is given by $\beta=(\beta_{0},\tilde{\beta}_{1},\tilde{\beta}_{2})$ . Throughout this paper we assume that $\beta\in B$ where $B\subset\mathbb{R}^{p_{1}+p_{2}-p^{\prime}}$ is a compact set.

These parameters are now estimated by least squares using the combined sample $\{Y_{\ell,i,j}:~{}j=1,\ldots,n_{\ell,i},i=1,\ldots,k_{\ell},\ell=1,2\}$ , that is

[TABLE]

2.2 Testing equivalence of regression curves

Following Liu et al., (2009) and Gsteiger et al., (2011) we consider the regression curves $m_{1}$ and $m_{2}$ to be equivalent if the maximum distance between the two curves is smaller than a given pre-specified constant, say $\varepsilon>0$ , that is,

[TABLE]

In clinical trial practice $\varepsilon$ is often referred to as a relevance threshold in the sense that if $d_{\infty}(\beta_{1},\beta_{2})<\varepsilon$ the difference between the two curves is believed not to be clinically relevant. In order to establish equivalence of the two curves $m_{1}$ and $m_{2}$ at a controlled type I error, we will develop a test for the hypotheses

[TABLE]

In the following we extend the bootstrap approach from Dette et al., (2018) to test the hypotheses (2.4) in the situation of common parameters. Note that the test procedure proposed below could also be applied to alternative measures of equivalence, such as the integrated deviation $\int_{\cal D}|m_{1}(t,\beta_{1})-m_{2}(t,\beta_{2})|dt$ .

Algorithm 2.1.

(parametric bootstrap for testing equivalence under the assumption of common parameters)

(1)

Calculate the ordinary least-square (OLS) parameter estimate (2.3) assuming a common parameter $\beta_{0}$ . The corresponding variance estimates are given by

[TABLE]

where $\hat{\beta}_{\ell}=(\hat{\beta}_{0},\hat{\tilde{\beta}}_{\ell}),\ \ell=1,2$ . Calculate the estimate

[TABLE]

for the maximal deviation between the two regression curves.

(2)

Define the constrained estimates

[TABLE]

where $\bar{\beta}_{1},\bar{\beta}_{2}$ minimize the objective function in (2.3) under the additional restriction

[TABLE]

Define ${\hat{\hat{d}}_{\infty}}=d_{\infty}({\hat{\hat{\beta}}_{1}},{\hat{\hat{\beta}}_{2}})$ and note that ${\hat{\hat{d}}_{\infty}}\geq\varepsilon$ .

The next two steps describe the (parametric) bootstrap procedure.

(3)

Generate data

[TABLE]

with independent and normally distributed errors $\eta_{\ell,i,j}^{*}\sim\mathcal{N}(0,\hat{\sigma}_{\ell}^{2})$ .

(4)

Calculate the OLS estimate $\hat{\beta}^{*}$ as in Step (1) and the test statistic

[TABLE]

where $\beta_{\ell}^{*}=(\beta_{0}^{*},\tilde{\beta}_{\ell}^{*}),\ \ell=1,2$ . The $\alpha-$ quantile of the distribution of the distribution of the statistic $\hat{d}^{*}_{\infty}$ is denoted by $q_{\alpha}^{*}$ and the null hypotheses in (2.4) is rejected, whenever

[TABLE]

In practice the $\hat{q}_{\alpha}^{*}$ can be calculated repeating steps (3) and (4), say $B$ times, in order to obtain replicates $\hat{d}^{*}_{\infty,1},\dots,\hat{d}^{*}_{\infty,B}$ of $\hat{d}^{*}_{\infty}$ . An estimate of $\hat{q}_{\alpha}^{*}$ then is defined by $\hat{q}_{\alpha}^{(B)}:=\hat{d}_{\infty}^{*(\lfloor B\alpha\rfloor)}$ , where $\hat{d}^{*(1)}_{\infty}\leq\ldots\leq\hat{d}^{*(B)}_{\infty}$ denotes the corresponding order statistic, and this estimate is used in (2.9) **

The following theorem states that this algorithm yields a valid test procedure. The proof is left to the Appendix 6.

Theorem 2.1.

The test defined by (2.9) is a consistent, asymptotic $\alpha$ -level test. That is

[TABLE]

whenever $d_{\infty}<\varepsilon$ , and

[TABLE]

if $d_{\infty}\geq\varepsilon$ .

Remark 2.2.

The results presented in this section remain correct in trials with a common placebo group, where $n_{0}$ observations are taken at dose level $d_{1}=0$ (corresponding to placebo), which are modelled by the random variables $Y_{0,1},\ldots,Y_{0,n_{0}}$ . For the sake of a simple presentation we consider location-scale type models, such that the common effect at the placebo can easily be modelled, but we note that more general models can be considered as well introducing additional constraints for the parameter.

To be precise, we assume that the models in (2.1) are given by

[TABLE]

where $m_{\ell}^{0}(0,\tilde{\beta}^{0}_{\ell})=0$ $(\ell=1,2)$ , such that the condition $m_{1}(0,\beta_{\ell})=m_{2}(0,\beta_{\ell})=\beta_{0,1}$ reflects the fact that there is only one placebo group (and as a consequence a common placebo parameter). Models of this type cover the most frequently used functional forms used in drug development and several examples can be found in Ting, (2006). Beside the location parameter $\beta_{0,1}$ there may be also other shared parameters, which we do not reflect in our notations for a better readability. The $\ell$ -th model is completely characterized by its parameter $\beta_{\ell}=(\beta_{0},\tilde{\beta_{\ell}})=(\beta_{0},\tilde{\beta}_{\ell,1},\tilde{\beta}^{0}_{\ell})$ , $\ell=1,2$ , and we obtain estimates of the model parameters by minimizing the sum of squares

[TABLE]

Theorem 2.1 remains valid in this situation and a proof can be found in the Appendix (see Section 6.4). **

2.3 Testing equivalence of model parameters

So far we assumed that the two regression models $m_{1}$ and $m_{2}$ share the common parameter $\beta_{0}$ . In practice it may be necessary to assess whether this assumption is plausible using an appropriate equivalence test for the shared model parameters. To be more precise, we recall the definition the parameters $\beta_{\ell}$ in model (2.1), i.e.

[TABLE]

and note that assumption (2.2) of $p^{\prime}$ common parameters in the models $m_{1}$ and $m_{2}$ can be represented as $(\beta_{1,1},\ldots,\beta_{1,p^{\prime}})=(\beta_{2,1},\ldots,\beta_{2,p^{\prime}})$ for $\ell=1,2$ . In order to investigate if this assumption holds at least approximately we construct a test for the hypotheses

[TABLE]

where $\delta$ denotes the equivalence margin. To be precise let $\hat{\beta}^{(\ell)}$ denote the least squares estimates in model $m_{\ell}$ for the sample $\{Y_{\ell,i,j}:~{}j=1,\ldots,n_{\ell,i},i=1,\ldots,k_{\ell}\}$ ( $\ell=1,2$ ), and assume that for large sample considerations the sample sizes $n_{\ell}$ and $n_{\ell,i}$ converge to infinity such that

[TABLE]

Under standard assumptions, which are listed in Section 6 it can be shown that the least squares estimate $\hat{\beta}^{(\ell)}$ of the parameter $\beta_{\ell}$ in model $m_{\ell}$ is approximately normal distributed, that is

[TABLE]

where the symbol $\stackrel{{\scriptstyle\cal D}}{{\longrightarrow}}$ means convergence in distribution and the matrix $\Sigma_{\ell}$ is defined by

[TABLE]

Here and throughout this paper we assume that the matrices $\Sigma_{1}$ and $\Sigma_{2}$ are non-singular. Consequently the difference $\sqrt{n}(\hat{\beta}^{(1)}-\hat{\beta}^{(2)})$ is also asymptotically normal distributed, and in particular it follows for the first $p^{\prime}$ components of the difference that

[TABLE]

where the matrix $\Omega$ is defined by

[TABLE]

$\Lambda_{\ell}^{-1}=\big{(}(\Sigma_{\ell}^{-1})_{ij})\big{)}_{i,j=1}^{p^{\prime}}$ denotes the upper-left $p^{\prime}\times p^{\prime}$ -block of the matrix $\Sigma_{\ell}^{-1}$ $(\ell=1,2)$ and $\lambda$ is defined in (2.16). Therefore we obtain the approximation

[TABLE]

where $\Omega$ is defined in (2.20). We can now apply the test $(2.2)$ proposed in Wang et al., (1999) by rejecting the null hypothesis $K_{0}$ in (2.14), whenever

[TABLE]

where $t_{1-\alpha,n-2}$ denotes the $1-\alpha$ quantile of the $t$ -distribution with $n-2$ degrees of freedom and $\hat{\Omega}_{ii}$ the $i$ th diagonal element of the matrix $\hat{\Omega}$ which is an estimate for the (unknown) covariance matrix $\Omega$ (this is obtained by replacing the unknown parameters $\beta_{\ell}$ , $\sigma^{2}_{\ell}$ and weights $\zeta_{\ell,i}$ in (2.18) by their corresponding estimates and $n_{\ell,i}/n_{\ell}$ , respectively).

3 Finite sample properties

We now investigate the finite sample properties of the bootstrap test proposed in Section 2.2 in terms of power and size using numerical simulations. The data is generated as follows:

(a)

We choose the functional form of the models $m_{1},m_{2}$ and specify their parameters $\beta_{1},\beta_{2}$ (including a common parameter $\beta_{0}$ ), which determine the true underlying models. Further we choose variances $\sigma_{\ell}^{2}$ and the actual dose levels $d_{\ell,i}$ , $\ell=1,2$ . 2. (b)

For each dose $d_{\ell,i}$ we calculate $n_{\ell,i}$ values for the response given by $m_{\ell}(d_{\ell,i},(\beta_{0},\tilde{\beta}_{\ell}))$ . By generating residual errors $\eta_{\ell,i,j}\sim N(0,\sigma_{\ell}^{2})$ we obtain the final response data

[TABLE]

The simulation results below were obtained using $1^{\prime}000$ simulation runs, where $B=500$ bootstrap replications were used to calculate quantiles of the bootstrap test.

In the following, we report the simulations results for power and size under three different scenarios. We consider the four-parameter sigmoid Emax model

[TABLE]

which is frequently used in practice when modeling dose response relationships (see for example Gabrielsson and Weiner, (2007) or Thomas et al., (2014)). In model (3.2) the parameter $\beta=(\beta_{1},\beta_{2},\beta_{3},\beta_{4})$ corresponds (in this order) to the placebo effect $E_{0}$ , the maximum effect $E_{max}$ , the Hill parameter $h$ determining the steepness of the dose-response curve and the dose $ED_{50}$ producing half of the maximum effect (Macdougall, (2006)). In what follows we add an index $\ell=0$ for a shared parameter or $\ell=1,2$ for the group under consideration.

Scenario 1: We assume the dose range $\mathcal{D}=[0,4]$ with identical dose levels $d_{\ell,i}=i-1$ , $i=1,2,3,4,5$ for both regression models $\ell=1,2$ . For each configuration of $\sigma_{\ell}^{2}=1,2,3$ we use (3.1) to simulate $n_{\ell,i}=6,18,30$ observations at each dose level $d_{\ell,i}$ , resulting in total sample sizes of $n_{\ell}=30,90,150$ , respectively. We first compare the two sigmoid Emax models

[TABLE]

assuming the shared parameters $(\beta_{0,1},\beta_{0,2},\beta_{0,3})$ . The only difference between the two models is in the $ED_{50}$ parameters $\tilde{\beta}_{1,4}$ and $\tilde{\beta}_{2,4}$ , which results in the need to estimate five parameters in total. We consider the reference sigmoid Emax model $m_{1}$ with the parameters $(\beta_{0,1},\beta_{0,2},\beta_{0,3})=(1,5,4)$ and $\tilde{\beta}_{1,4}=1$ . This reference model is compared to various specifications of the second model $m_{2}$ determined by $\tilde{\beta}_{2,4}=1.99$ , $1.77$ , $1.59$ , $1.43$ , $1.37$ , $1$ and common shared parameters $(\beta_{0,1},\beta_{0,2},\beta_{0,3})$ . The values for $\tilde{\beta}_{2,4}$ were chosen such that the maximum absolute distances $d_{\infty}=\max_{d\in{\cal D}}\left|m_{2}(\mathbf{\beta}_{2},d)-m_{1}(\mathbf{\beta}_{1},d)\right|$ are given by $2$ , $1.5$ , $1$ , $0.5$ , $0.25$ , [math] respectively. For $d_{\infty}>0$ these are attained at the dose levels $1.61,\ 1.52,\ 1.44,\ 1.37$ and $1.33$ ; see Figure 1 $a$ . For $d_{\infty}=0$ , that is $\tilde{\beta}_{1,4}=\tilde{\beta}_{2,4}$ , the maximum distance is attained at every point in $\mathcal{D}$ .

In Table 1 we summarize the simulated rejection probabilities of the bootstrap test (2.9) under the null hypothesis (2.4) with $d_{\infty}=2,1.5,1$ and $\varepsilon=1$ . We conclude that the bootstrap test controls its level in all cases under consideration. At the margin of the null hypothesis (i.e. $d_{\infty}=1$ ) the approximation of the level is very precise, even for sample sizes as small as $n_{\ell,i}=6$ .

We also investigated the relative residual mean squared errors (RRMSE) of the parameters estimates. Table 2 summarizes the simulation results only for $d_{\infty}=1$ (i.e. at the margin of the null hypothesis), as the results are similar for other choices of $d_{\infty}$ . We conclude that the RRMSE for estimating the Hill parameter $\beta_{0,3}$ is (by far) the largest. This phenomenon has also been observed by Mielke, (2016). We also observe that all estimation errors decrease with larger sample sizes and smaller variances. Table 2 also summarizes the RRMSE when fixing the Hill parameter at $\beta_{0,3}=4$ (see the numbers in brackets). In this case, four parameters need to be estimated in total and the estimation errors become slightly smaller. We also repeated the Type I error rate simulations when fixing the Hill parameter at $\beta_{0,3}=4$ . The the results are reported in Table 1 (numbers in brackets) and we conclude that the size is well controlled within the simulation error.

In Table 3 we summarize the power of the bootstrap when generating the data under the alternative $d_{\infty}=0.5,0.25,0$ and $\varepsilon=1$ . As expected, the power increases with larger sample sizes and smaller variances and is reasonably high across all configurations. Fixing the Hill parameter significantly improves the power which can be explained by the difficulty of estimating this parameter precisely, as discussed above.

Scenario 2: We maintain the basic settings from Scenario $1$ . We consider again two sigmoid Emax models

[TABLE]

but assume now that both, the placebo response $\beta_{0,1}$ and the maximum treatment effect $\beta_{0,2}$ , are the same. For the reference model we chose $(\beta_{0,1},\beta_{0,2})=(1,5)$ , $\tilde{\beta}_{1,3}=4.5$ and $\tilde{\beta}_{1,4}=1.3$ . We investigated the maximum distances $d_{\infty}=2,1.5,1,0.5,0.25,0$ , which resulted in the following parameter configurations for the second model:

[TABLE]

The maximum distances between the two curves are now attained at the dose levels $0.66,\ 0.75,\ 0.83,\ 0.9$ and $0.93$ ; see Figure 1 $b$ .

In Table 4 we summarize the simulated rejection probabilities under the null hypothesis (2.4) for $d_{\infty}=2,1.5,1$ and $\varepsilon=1$ . We conclude again that the bootstrap test controls the designated significance level in all cases under consideration. Especially at the margin $d_{\infty}=1$ the simulated Type I error rates are close to the nominal level $\alpha$ . These observations apply regardless of whether the Hill parameter is estimated or fixed at the true underlying values given in (3).

In Table 5 we summarize the simulated power of the bootstrap test under the alternative $d_{\infty}=0.5,0.25,0$ and $\varepsilon=1$ . As expected, the power decreases for increasing values of $d_{\infty}$ and for higher variances or smaller sample sizes. One noticeable exception occurs at $d_{\infty}=0$ , where in some cases the power is smaller than for $d_{\infty}=0.25$ . This effect can be explained theoretically when considering the proofs for the bootstrap test. In case of $d_{\infty}=0$ the set $\cal E=\cal E^{+}\cup\cal E^{-}$ containing all points where the maximum distance between the two curves is attained (see Appendix 6.3) consists of the entire dose range $\cal D$ . Therefore, the asymptotic distribution of the test statistic is not Gaussian but a maximum of Gaussian processes. This complex structure of the asymptotic distribution has an impact on the bootstrap procedure and explains the decrease in power for $d_{\infty}=0$ . This phenomenon can also be observed, although to a lesser degree, in Scenario $1$ . Finally, we observe higher power values when fixing the Hill parameter compared to the situation where it has to be estimated.

Scenario 3: We now investigate the operating characteristics of the bootstrap test assuming three, two, one and no shared parameters. We set $\varepsilon=1$ , $\alpha=0.05$ and compare again two sigmoid Emax models. The true placebo response is chosen as $\beta_{1,1}=\beta_{2,1}=0$ . The reference model $m_{1}$ is specified by $(\beta_{1,2},\beta_{1,3},\beta_{1,4})=(5,2,1.3)$ . The second model is specified by $(\beta_{2,2},\beta_{2,3},\beta_{1,4})=(5,2,\kappa)$ , where $\kappa\in(1.3,3]$ is chosen such that the maximum distances with respect to $m_{1}$ lie between [math] and $2$ ; see the resulting curves plotted in Figure 2 $(a)$ for $\kappa=1.5,1.7,2,2.5,3$ . Consequently the parameters specifying the two models only differ in $ED_{50}$ parameters $\tilde{\beta}_{1,4}$ and $\tilde{\beta}_{2,4}$ and the maximum distance is determined by the choice of $\kappa$ . As all other parameters are the same for the two models, we can compare the bootstrap test assuming three, two, one and no shared parameter. Note that we do not consider the case of identical models (i.e. $\kappa=1.3$ ) because of the discontinuity of power at $d_{\infty}=0$ described under Scenario 2. The dose range is given by $\mathcal{D}=[0,10]$ with $5$ different dose levels $d_{\ell,1}=0$ , $d_{\ell,2}=1$ , $d_{\ell,3}=2$ , $d_{\ell,4}=5$ , $d_{\ell,5}=10,\ell=1,2$ . We create $n_{\ell,i}=35$ observations at each dose level for each group according to (3.1), which results in a total sample size of $n=n_{1}+n_{2}=350$ . Finally, we choose $\sigma_{\ell}^{2}=2$ , $\ell=1,2$ .

In Figure 2 $(b)$ we plot the proportion of rejections in dependence of the true maximum absolute difference $d_{\infty}\in(0,2]$ . Under the null hypothesis $d_{\infty}\geq 1$ all four tests control their level, as the proportion of rejections is smaller than or equal to $\alpha=0.05$ within simulation errors. Looking at the region $d_{\infty}<1$ , we observe that the test assuming three shared parameters has the highest power among all four tests, followed by the test assuming two shared parameters. The difference between the tests assuming one and no shared parameter is rather small. Concluding, the more parameters can be assumed to be common for the two regression curves the higher is the power of the test. Note, however, that strictly speaking the hypotheses (2.4) are different when assuming three, two, one and no shared parameters and that the perceived power gain when assuming more shared parameters comes at the cost of making additional assumptions that need to be verified in practice, as illustrated with the clinical trial example in Section 4.

4 Clinical trial example

We now illustrate the proposed method with a multi-regional clinical trial example. The objective of this trial is to evaluate the dose response relationships in Caucasian and Japanese patients and assess their similarity. Based on data from previous clinical trials investigating a drug with a similar mode of action, it is reasonable to assume a similar response to placebo and a common maximum treatment effect in both populations, with the main difference expected to be in a different onset of treatment effect. Using the sigmoid Emax model (3.2), these consideration thus lead to different $ED_{50}$ and Hill parameters for the two dose response curves. Because the trial is still at its design stage, we simulate data based on the trial assumptions. To maintain confidentiality, we scale the actual doses to lie within the [0, 15] interval. These limitations do not change the utility of the calculations below.

We assume $60$ Japanese and $240$ Caucasian patients, resulting in $300$ patients overall. Patients from both populations are randomized to receive either placebo (dose level [math]) or one of three active dose levels, namely $1,\ 3,\ 15$ for the Japanese and $0.5,\ 9$ and $15$ for the Caucasian patients. Assuming equal allocation of patients within each population, we thus have 75, 60, 15, 15, 60, and 75 patients randomized to the dose levels $0,\ 0.5,1,3,\ 9$ and $15$ , respectively. The response variable is assumed to be normally distributed and larger values indicate a better outcome. Pharmacological and clinical considerations suggest the use of the (three-parameter) Emax model with the Hill parameter fixed at 1. Later on we relax this assumption as part of a sensitivity analysis. The R code for this example and all other calculations in this paper is available from the authors upon request.

In Figure 3 we display the fitted dose response models $m_{1}(d,\hat{\beta}_{1})$ and $m_{2}(d,\hat{\beta}_{2})$ for the Japanese and Caucasian patients, respectively, together with the individual observations, where $d\in[0,15]$ and the $y$ -axis is truncated to $[-1,6]$ for better readability. The parameter estimates from the two separate model fits are given by $\hat{\beta}_{1}=(-0.195,4.751,11.991)$ and $\hat{\beta}_{2}=(-0.002,5.676,33.887)$ . The observed differences for the placebo response and the maximum treatment effect are given by $|\hat{\beta}_{1,1}-\hat{\beta}_{2,1}|=0.193$ and $|\hat{\beta}_{1,2}-\hat{\beta}_{2,2}|=0.925$ , respectively, and thus relatively small, as it also transpires from the plots in Figure 3. To corroborate this empirical observation, we formally test whether the assumption of shared parameters is plausible by applying the equivalence test described in Section 2.3 on the data set under consideration. We choose the threshold $\delta=1.5$ and therefore test the null hypothesis $K_{0}:\max_{i=1,2}\left|\beta_{1,i}-\beta_{2,i}\right|\geq 1.5$ against the alternative $K_{1}:\max_{i=1,2}\left|\beta_{1,i}-\beta_{2,i}\right|<1.5.$ Applying the test (2.21) for $\alpha=0.05$ , we obtain $\hat{\Omega}_{11}=3127.91$ and $\hat{\Omega}_{22}=10748.27$ and therefore $\delta-t_{1-\alpha,n-2}\big{(}\tfrac{\hat{\Omega}_{11}}{n(n-2)}\big{)}^{1/2}=1.191$ and $\delta-t_{1-\alpha,n-2}\big{(}\tfrac{\hat{\Omega}_{22}}{n(n-2)}\big{)}^{1/2}=0.928$ , respectively. We can thus reject $K_{0}$ at the relatively stringent 5% level and conclude equivalence of the two parameters, which justifies using the bootstrap test (2.1) with shared parameters.

We now evaluate the similarity of the dose response curves for the Japanese and Caucasian patients, assuming the same placebo and maximum treatment effect. In order to compute the non-linear least squares estimates in model (2.1) with (2.2) we formulate the objective function of the minimization step as

[TABLE]

Here, $\beta_{0,1}$ denotes the (shared) placebo effect, $\beta_{0,2}$ the (shared) maximum treatment effect $E_{max}$ , and $\tilde{\beta}_{1,3}$ and $\tilde{\beta}_{2,3}$ the $ED_{50}$ parameters of the two models. Using the auglag function from the alabama package Varadhan, (2014) to solve the above optimization problem, we obtain the parameter estimates $\hat{\beta}_{0,1}=-0.064\ (0.074)$ , $\hat{\beta}_{0,2}=5.366\ (0.137)$ , $\hat{\tilde{\beta}}_{1,3}=19.400\ (2.634)$ and $\hat{\tilde{\beta}}_{2,3}=25.681\ (3.256)$ . In brackets we report the associated standard errors, which have to be calculated manually based on (5.5). The estimates for the population variances are $\hat{\sigma}_{1}^{2}=0.508$ and $\hat{\sigma}_{2}^{2}=0.455$ . The observed maximum difference between both curves over the investigated dose range $[0,15]$ is $\hat{d}_{\infty}=0.376$ , attained at dose $2.23$ . We apply the bootstrap test (2.9) using $B=1^{\prime}000$ bootstrap replications. Setting $\varepsilon=0.7$ for the equivalence margin in (2.4), we obtain the quantile $q_{0.05}=0.438$ for $\alpha=0.05$ . Thus, we reject the null hypothesis (2.4) at the 5% significance level and conclude that the dose response curves for the Japanese and Caucasian populations are similar, under the shared parameter assumption. Alternatively, we can calculate the $p$ -value $\frac{1}{B}\sum_{i=1}^{B}I(d_{\infty}^{*(i)}\leq\hat{d}_{\infty})=0.023$ for the bootstrap test and obtain the same test decision at level $\alpha=0.05$ . For illustration purposes we also apply the bootstrap test (2.9) but without shared parameters (yet under the assumption of a fixed Hill parameter). Accordingly, we obtain a considerably larger $p$ -value of $0.458$ , which supports our findings from Scenario $3$ in Section 3 about the loss in power when no shared parameters are assumed. In this case the observed maximum distance is $\hat{d}_{\infty}=0.706$ , attained at dose $1.42$ , and the quantile of the bootstrap distribution is $q_{0.05}=0.449$ .

Finally, we perform a sensitivity analysis to investigate the assumption of the Hill parameter being equal to 1. As part of this analysis we repeat the model fit and the bootstrap test using the sigmoid Emax model (3.2) where the Hill parameter is now part of the estimation. The parameter estimates (standard errors in brackets) are $\hat{\beta}_{0,1}=0.037\ (0.082)$ , $\hat{\beta}_{0,2}=4.544\ (0.218)$ , $\hat{\tilde{\beta}}_{1,3}=1.05\ (0.229)$ , $\hat{\tilde{\beta}}_{1,4}=13.542\ (2.095)$ , $\hat{\tilde{\beta}}_{2,3}=1.650\ (0.331)$ and $\hat{\tilde{\beta}}_{2,4}=16.558\ (4.521)$ . Now, the maximum distance between the curves is $\hat{d}_{\infty}=0.640$ , attained at dose $0.6$ . It turns out that the standard errors of the estimates are slightly higher which is in line with the results shown in the simulation studies in Section 3. Performing again the bootstrap test with two shared parameters results the quantile $q_{0.05}=0.429$ and the $p$ -value $0.285$ . Consequently, we cannot reject the null hypothesis in this case. In conclusion, fixing the Hill parameter to 1 and assuming both the placebo effect and the maximum treatment effect to be the same in both populations clearly results in the most powerful procedure. We can demonstrate equivalence at the significance level of $\alpha=0.05$ , whereas in case of estimating both models separately (i.e. no shared parameters) or including the Hill parameter in the estimation we obtain considerably larger $p$ -values.

5 Conclusion

In this paper we developed a new test for the equivalence of two regression curves when it is reasonable to assume that some model parameters are the same. Our approach is based on an estimate of the maximum deviation between the two curves, where critical values are obtained by a novel constraint bootstrap procedure. We demonstrated that the new test controls its level properly and is consistent.

We investigated the finite sample properties of the proposed procedure using extensive simulations and observed that the Type I error rate is controlled in all scenarios under consideration, even for sample sizes as small as $6$ patients per dose level. Further, we concluded that the test reaches a reasonable power that increases with larger sample sizes. In particular, we demonstrated that the power of tests for the equivalence of curves can be improved substantially by using the additional information of common parameters in the two regression curves. This effect could also be observed in the clinical trial example, which showed the power advantage of the bootstrap test (2.9) if the underlying assumptions are well justified. Relaxing those assumptions may lead to more robust conclusions, but only at a cost of a loss in power.

An interesting extension of the proposed methodology arises from the need to include covariates in clinical trial practice. Covariates can be continuous (e.g. age or body mass index), categorical (e.g. disease status or race), or binary (e.g. gender or smoking yes/no), possibly changing over time. These cases may have to be treated differently and we leave this problem for future research. Another area of research could be the assessment of similarity in two nested populations, thus relaxing the assumption of independence between the observations. In our multi-regional clinical trial example we compared the Japanese with Caucasian patients. It will be interesting and relevant to clinical trials to explore the development of the proposed methods when comparing the Japanese with an overall population that includes Japanese and Caucasian patients. Again, we leave this topic for future research.

Acknowledgments This work has been supported in part by the Collaborative Research Center “Statistical modeling of nonlinear dynamic processes” (SFB 823, Teilprojekt T1) of the German Research Foundation (DFG) Parts of this manuscript were written while Frank Bretz was on a sabbatical leave at University of Canterbury in Christchurch, New Zealand. He would like to thank Dr. Daniel Gerhard for his support.

6 Appendix: Proof of Theorem 2.1 and Remark 2.2

The proof of the theoretical results of this paper proceeds in several steps. First we state the assumptions under which the statements hold (Section 6.1). Second, we derive the asymptotic distribution of the parameter estimates in models with common parameters (Section 6.2). In Section 6.3 we derive a result on the weak convergence of a stochastic process, from which the proof of Theorem 2.1 and Remark 2.2 can be derived (see Section 6.4).

6.1 Assumptions

For the theoretical results of this paper we make the same assumptions as in Dette et al., (2018).

The errors $\eta_{\ell,i,j}$ are independent, have finite variance $\sigma_{\ell}^{2}$ and expectation zero. 2. 2.

The covariate region $\mathcal{D}\subset\mathbb{R}^{d}$ is compact and the number and location of dose levels $k_{\ell}$ does not depend on $n_{\ell},\ \ell=1,2$ . 3. 3.

All estimates of the parameters $\beta_{1},\beta_{2}$ are computed over compact sets $B_{1}\subset\mathbb{R}^{p_{1}}$ and $B_{2}\subset\mathbb{R}^{p_{2}}$ . 4. 4.

The regression functions $m_{1}$ and $m_{2}$ are twice continuously differentiable with respect to the parameters for all $b_{1},b_{2}$ in neighbourhoods of the true parameters $\beta_{1},\beta_{2}$ and all $d\in\mathcal{D}$ . The functions $(d,b_{\ell})\mapsto m_{\ell}(d,b_{\ell})$ and their first two derivatives are continuous on $\mathcal{D}\times B_{\ell}$ . 5. 5.

The gradients with respect to the parameters are uniformly bounded, that is

$\sup_{d\in\mathcal{D}}\left\|\tfrac{\partial}{\partial\beta_{\ell}}m_{\ell}(d,\beta_{\ell})\right\|<\infty,\ \ell=1,2.$ 6. 6.

Defining $\psi_{a,\ell}^{(n)}(b):=\sum_{i=1}^{k_{\ell}}\frac{n_{\ell,i}}{n_{\ell}}(m_{\ell}(d_{\ell,i},a)-m_{\ell}(d_{\ell,i},b))^{2},$ we assume that for any $u>0$ there exists a constant $v_{u,\ell}>0$ such that

[TABLE]

6.2 Asymptotic properties of the OLS

In this section we derive the asymptotic normality of the parameter estimates in models with common parameters. Observing the definition of the OLS $\hat{\beta}=(\hat{\beta}_{0},\hat{\tilde{\beta}}_{1},\hat{\tilde{\beta}}_{2})$ in (2.3) we obtain, by taking the partial derivatives, $\hat{\beta}$ by the necessary conditions

[TABLE]

Defining

[TABLE]

for $\ell=1,2$ we can write

[TABLE]

where $0=0_{p-p^{\prime}}$ denotes the zero vector in $\mathbb{R}^{p-p^{\prime}}$ . Therefore the equations in (5.1) can be summarized to

[TABLE]

A Taylor expansion now yields

[TABLE]

and therefore $\hat{\beta}$ can be linearized as

[TABLE]

Due to the strong law of large numbers it holds

[TABLE]

where the matrices $\tilde{\Sigma}_{1}$ and $\tilde{\Sigma}_{2}$ are defined by

[TABLE]

respectively. Therefore, using the representation in (5.3) and the result in (5.4) we obtain the asymptotic normality of the estimate, that is

[TABLE]

6.3 Weak convergence of a stochastic process

The essential step in the proof is a result regarding the weak convergence of the process

[TABLE]

We further define $\tilde{\Delta}(d,\beta)=\tilde{\Delta}(d,\beta_{0},\tilde{\beta}_{1},\tilde{\beta}_{2})=\tilde{\Delta}(d,\beta_{1},\beta_{2})=m_{1}(d,\beta_{1})-m_{2}(d,\beta_{2}),$ which yields $\tilde{p}_{n}(d)=\tilde{\Delta}(d,\hat{\beta})-\tilde{\Delta}(d,\beta).$

Proof.

At first we derive a linearization of $\tilde{p}_{n}$ by using the Taylor expansion $\tilde{\Delta}(d,\hat{\beta})=\tilde{\Delta}(d,\beta)+\big{(}\tfrac{\partial\tilde{\Delta}(d,\beta)}{\partial\beta}\big{)}^{T}(\hat{\beta}-\beta)^{T}+R(\beta),$ where $R(\beta)$ is a remainder term and due to (5.3) we have $R(\beta)=O_{\mathbb{P}}(\tfrac{1}{n})$ . Therefore it holds $\sqrt{n}\tilde{p}_{n}(d)=\sqrt{n}\big{(}\tfrac{\partial\tilde{\Delta}(d,\beta)}{\partial\beta}\big{)}^{T}(\hat{\beta}-\beta)^{T}+o_{\mathbb{P}}(1)$ uniformly with respect to $d\in\mathcal{D}$ (note that due to the assumptions $\tfrac{\partial\tilde{\Delta}(d,\beta)}{\partial\beta}$ is continuous in $d\in\mathcal{D}$ and $\mathcal{D}$ is a compact set) and with the result from (5.3) this can be written as

[TABLE]

Observing (5.3) we define

[TABLE]

and obtain

[TABLE]

We now define $f(d):=\big{(}\tfrac{\partial\tilde{\Delta}(d,\beta)}{\partial\beta}\big{)}^{T}\Sigma^{-1/2},$ and we consider the map

[TABLE]

Obviously, using the same arguments as before, $\tfrac{\partial\tilde{\Delta}(d,\beta)}{\partial\beta}$ is continuous and therefore the same holds for $\tilde{\Phi}$ . Consequently, (5.5) and the Continuous Mapping Theorem (see for example van der Vaart, (2000)[p.7 f.]) yield

[TABLE]

uniformly with respect to $d\in\mathcal{D}$ , where $Z\sim\mathcal{N}(0,I_{p_{1}+p_{2}-p^{\prime}})$ , which proves the assertion. ∎

6.4 Proof of Theorem 2.1 and Remark 2.2

The assertion of Theorem 2.1 now follows by the same arguments as given in the proof of Theorem 4 in Dette et al., (2018). In particular it can be shown that

[TABLE]

where $\mathcal{E}^{\pm}=\{d\in\mathcal{D}|\ m_{1}(d,\beta_{1})-m_{2}(d,\beta_{2})=\pm d_{\infty}\}$ denote the sets of extremal points, that is those points, where the unknown difference $m_{1}-m_{2}$ attains it maximum absolute deviation. The details are omitted for the sake of brevity.

Similarly, note that in the situation of a common placebo group as described in Remark 2.2 the estimates are obtained by minimizing the sum of squares in (2.13). As $m_{\ell}^{0}(0,\tilde{\beta}^{0}_{\ell})=0$ , $\ell=1,2$ , this function is the same as the one which is obtained allocating the observations at placebo arbitrarily to the two groups. More precisely, we can also write the sum of squares in (2.13) as

[TABLE]

where $Y_{1,1,j}=Y_{{0,j}}$ ( $j=1,\ldots,n_{1,1}$ ) and $Y_{2,1,j}=Y_{{0,j}}$ ( $j=n_{1,1}+1,\ldots,n_{1,1}+n_{2,1}=n_{0}$ ). This corresponds to the minimzation of the sum of squares in (2.3) with a common intercept $b_{0,1}$ , and consequently this situation can be treated in the same way as considering two different placebo groups with a common intercept $b_{0,1}$ . By the same arguments as given in Appendix 6.2 the corresponding estimates are asymptotically normal distributed (if $n_{1,1}/n\to c_{1}$ and $n_{2,1}/n\to c_{2}$ for some constants $c_{1},c_{2},\in(0,1)$ ), and the proof in Section 6.3 shows that Theorem 2.1 remains valid in the model with a common placebo group. As a consequence we obtain the claim in Remark 2.2

Bibliography20

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Berger, (1982) Berger, R. L. (1982). Multiparameter hypothesis testing and acceptance sampling. Technometrics , 24:295–300.
2Bretz et al., (2016) Bretz, F., M. K., Dette, H., Liu, W., and Trampisch, M. (2016). Assessing the similarity of dose response and target doses in two non-overlapping subgroups. ar Xiv:1607.05424 .
3Collignon et al., (2018) Collignon, O., Moellenhoff, K., and Dette, H. (2018). Equivalence analyses of dissolution profiles with the mahalanobis distance: a regulatory perspective and a comparison with a parametric maximum deviation-based approach. Biometrical Journal .
4Dette et al., (2018) Dette, H., Möllenhoff, K., Volgushev, S., and Bretz, F. (2018). Equivalence of regression curves. Journal of the American Statistical Association , 113:711–729.
5Feng et al., (2015) Feng, L., Zou, C., Wang, Z., and Zhu, L. (2015). Robust comparison of regression curves. TEST , 24(1):185–204.
6Gabrielsson and Weiner, (2007) Gabrielsson, J. and Weiner, D. (2007). Pharmacokinetic and Pharmacodynamic Data Analysis: Concepts and Applications . Swedish Pharmaceutical Press, Stockholm, 4th edition.
7Gsteiger et al., (2011) Gsteiger, S., Bretz, F., and Liu, W. (2011). Simultaneous confidence bands for nonlinear regression models with application to population pharmacokinetic analyses. Journal of Biopharmaceutical Statistics , 21(4):708–725.
8Hoffelder, (2018) Hoffelder, T. (2018). Equivalence analyses of dissolution profiles with the mahalanobis distance. Biometrical Journal .

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Equivalence of regression curves sharing common parameters

Abstract

1 Introduction

2 Methodology

2.1 Models with common parameters

2.2 Testing equivalence of regression curves

Algorithm 2.1**.**

Theorem 2.1**.**

Remark 2.2**.**

2.3 Testing equivalence of model parameters

3 Finite sample properties

4 Clinical trial example

5 Conclusion

6 Appendix: Proof of Theorem 2.1 and Remark 2.2

6.1 Assumptions

6.2 Asymptotic properties of the OLS

6.3 Weak convergence of a stochastic process

Proof.

6.4 Proof of Theorem 2.1 and Remark 2.2

Algorithm 2.1.

Theorem 2.1.

Remark 2.2.