Optimal Designs for Prediction in Two Treatment Groups Random Coefficient Regression Models
Maryna Prus

TL;DR
This paper develops optimal experimental designs for estimating fixed effects and predicting random effects in two-treatment group random coefficient regression models, enhancing efficiency in such statistical analyses.
Contribution
It introduces A- and D-optimality criteria tailored for these models, providing a systematic approach to design optimal experiments for fixed and random effect estimation.
Findings
Optimal designs improve estimation accuracy.
Illustrative example demonstrates design behavior.
Criteria facilitate efficient experimental planning.
Abstract
The subject of this work is two treatment groups random coefficient regression models, in which observational units receive some group-specific treatments. We provide A- and D-optimality criteria for the estimation of the fixed parameter and the prediction of the random effects. We illustrate the behavior of optimal designs by a simple example.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimal Experimental Design Methods · Statistical Methods and Inference
**Optimal Designs for Prediction in Two Treatment Groups Random Coefficient Regression Models
Maryna Prus*Maryna Prus: [email protected]
Otto-von-Guericke University Magdeburg, Institute for Mathematical Stochastics,
PF 4120, D-39016 Magdeburg, Germany
Abstract: The subject of this work is two treatment groups random coefficient regression models, in which observational units receive some group-specific treatments. We provide A- and D-optimal designs (optimal group sizes) for the estimation of fixed effects and the prediction of random effects. We illustrate the obtained results by a numerical example.
Keywords: Mixed models, estimation and prediction, optimal design, cluster randomization
1 Introduction
The subject of this paper is optimal designs in two treatment groups random coefficient regression (RCR) models, in which observational units receive some group-specific kinds of treatment. These models are typically used for cluster randomized trials. For some real data examples see e. g. Piepho and Möhring (2010).
Optimal designs for fixed effects models with multiple groups are well discussed in the literature (see e. g. Bailey (2008), ch. 3). In models with random coefficients, the estimation of population parameters (fixed effects) is usually of prior interest (see e. g Fedorov and Jones (2005), Kunert et al. (2010), Van Breukelen and Candel (2018)). Optimal designs for the prediction of random effects in models with known population parameters have been considered in detail in Gladitz and Pilz (1982). Prus and Schwabe (2016) provide analytical results for the models with unknown population mean under the assumption of the same design for all individuals. Multiple group models with fixed group sizes were briefly discussed in Prus (2015), ch. 6.
Here, we consider two groups models with unknown population parameters and group specific designs. We provide A- and D-optimality criteria for the estimation and the prediction of fixed and random effects, respectively. Our main focus is optimal designs for the prediction.
The paper is structured in the following way: In Section 2 the two groups RCR model will be introduced. Section 3 presents the best linear unbiased estimator for the population parameter and the best linear unbiased predictor for individual random effects. Section 4 provides analytical results for the designs, which are optimal for the estimation or for the prediction. The paper will be concluded by a short discussion in Section 5.
2 Two Treatment Groups RCR Model
In this work we consider RCR models with two treatment groups and , where observational units (people, plots, studies, etc.) receive group-specific kinds of treatment, and , respectively. Further we will use the term "individuals" instead of "observational units" for simplicity. The first group includes individuals and the second group individuals. The groups sizes and are to be optimized and the total number of individuals in the experiment is fixed. The -th observation at the -th individual is described for the first group by
[TABLE]
and for the second group by
[TABLE]
where is the number of observations per individual, which is assumed to be the same for both groups, and are the observational errors in the first and the second groups with zero expected value and the variances and , respectively. and are the individual response parameters.
As it has been already mentioned above, we optimize the group sizes and . Therefore, we define the individual parameters for all individuals for both groups: \mbox{\boldmath{\theta}}_{i}:=(\mu_{1i},\mu_{2i})^{\top}, . The parameters can be interpreted as follows: Let individual be in the second group. Then the parameter describes the response, which would be observed at individual if the individual had received treatment , and is the usual response parameter of the individual. The latter parametrization allows to identify the best kind of treatment for each individual (for future treatments), which can be useful in practical situations where only one treatment per individual is possible.
The individual parameters are assumed to have an unknown mean \mbox{E}(\mbox{\boldmath{\theta}}_{i})=(\mu_{1},\mu_{2})^{\top}=:\mbox{\boldmath{\theta}}_{0} and a covariance matrix \mbox{Cov}(\mbox{\boldmath{\theta}}_{i})=\textrm{diag}(\sigma_{1}^{2}\,u,\sigma_{2}^{2}\,v) for given dispersions and . All individual parameters \mbox{\boldmath{\theta}}_{i} and all observational errors and , , , are assumed to be uncorrelated.
Note that this model is not a particular case of the RCR models considered by Prus and Schwabe (2016). In contrast to that paper, here the expected values for the response parameters and in the first and the second groups are not the same (which is equivalent to different regression functions in the parametrization using \mbox{\boldmath{\theta}}_{i}) and group sizes are non-fixed. Therefore, the approach proposed by Prus and Schwabe (2016) cannot be used.
Further we focus on the following contrasts: the population parameter and the individual random parameters , . describes the difference between the mean parameters and in the first and in the second group, respectively, and may be interpreted as the difference for individual between the present response and the response, which could have been observed if the individual had received another treatment. We search for the designs (group sizes), which are optimal for the estimation of or for the prediction of .
3 Estimation and Prediction
In this section we concentrate on the estimation of the population parameter and the prediction of the individual parameters . We use the standard notation and for the mean response in the first and the second treatment group, respectively, and obtain the following best linear unbiased estimator (BLUE) for .
Theorem 1**.**
- a)
The BLUE for the population parameter is given by
[TABLE] 2. b)
The variance of the BLUE is given by
[TABLE]
Further we use the notation and for the mean individual response for individuals in the first and in the second treatment group, respectively. We obtain the next result for the best linear unbiased predictor (BLUP) for the individual response parameter .
Theorem 2**.**
The BLUP for the individual response parameter is given by
[TABLE]
The next theorem presents the mean squared error (MSE) matrix for the total vector \hat{\mbox{\boldmath{\alpha}}}:=\left(\hat{\alpha}_{1},...,\hat{\alpha}_{N}\right)^{\top} of all BLUPs for all individuals.
Theorem 3**.**
The MSE matrix of the vector \hat{\mbox{\boldmath{\alpha}}} of individual predictors is given by
[TABLE]
for
[TABLE]
where denotes the vector of length with all entries equal to , is the identity matrix and denotes the Kronecker product,
[TABLE]
and
[TABLE]
Proofs of Theorems 1-3 are deferred to Appendix A.
4 Experimental Design
We define the experimental (exact) design for the RCR model with two treatment groups and as follows:
[TABLE]
For analytical purposes, we generalize this to the definition of an approximate design:
[TABLE]
where and are the allocation rates for the first and the second groups, respectively, and only the condition has to be satisfied. Then only the optimal allocation rate to the first group has to be determined for finding an optimal design.
Further we search for the allocation rates, which minimize variance (4) of the BLUE and MSE matrix (6) of the BLUP \hat{\mbox{\boldmath{\alpha}}} and concentrate on the A- (average) and D- (determinant) optimality criteria.
4.1 Optimal designs for estimation of population parameter
For the estimation of the population parameter both A- and D-criteria may be considered to be equal to variance (4) of the BLUE . We rewrite the variance of the estimator in terms of the approximate design and receive the following optimality criterion (neglecting the constant factor ):
[TABLE]
Criterion function (7) can be minimized directly. The optimal allocation rate for the estimation of the population parameter is is given by
[TABLE]
Note that the optimal allocation rate to the first group increases with increasing observational error variance and the dispersion of random effects for the first group and decreases with variance parameters and for the second group. Note also that if the observational error variance is the same for both groups (), is larger than for and smaller than for .
4.2 Optimal designs for prediction of individual response parameters
We define the -criterion for the prediction of the individual response parameters \mbox{\boldmath{\alpha}}=(\alpha_{1},...,\alpha_{N})^{\top} as the trace of MSE matrix (6):
[TABLE]
We extend this definition for approximate designs and receive the following result (neglecting the constant factor ).
Theorem 4**.**
The A-criterion for the prediction of the individual response parameters \mbox{\boldmath{\alpha}}=(\alpha_{1},...,\alpha_{N})^{\top} is given by
[TABLE]
where
[TABLE]
For this criterion no explicit formulas for optimal allocation rates can be provided. For given dispersion matrix of random effects (given values of an ), the problem of optimal designs can be solved numerically. In this work we are however interested in the behavior of optimal designs with respect to the variance parameters. Therefore, we consider some special cases, which illustrate this behavior.
Special case 1: and
If the variances and of the observational errors as well as the dispersions and (and consequently the variances and ) of the random effects are the same for both groups, A-criterion (10) simplifies to
[TABLE]
where
[TABLE]
for (neglecting the factor and the observational errors variance). We obtain for this criterion the optimal allocation rate , which is also optimal for estimation in the fixed-effects model ().
Special case 2:
If only the variances and of the observational errors are the same for both groups, the A-criterion for the prediction simplifies to
[TABLE]
where
[TABLE]
(neglecting the observational errors variance). The behavior of the optimal allocation rate will be considered for this case in a numerical example later.
The D-criterion for the prediction of \mbox{\boldmath{\alpha}}=(\alpha_{1},...,\alpha_{N})^{\top} can be defined as the logarithm of the determinant of MSE matrix (6):
[TABLE]
For approximate designs we obtain the next result.
Theorem 5**.**
The D-criterion for the prediction of the individual response parameters \mbox{\boldmath{\alpha}}=(\alpha_{1},...,\alpha_{N})^{\top} is given by
[TABLE]
where
[TABLE]
Proof.
We compute the determinant of MSE matrix (6) using the formula for block-matrices
[TABLE]
Then we rewrite the result in terms of the approximate design and receive criterion (14). ∎
Also for this criterion no finite analytical solutions for optimal designs can be provided. We consider the same special cases as for the A-criterion.
Special case 1: and
If the variances of the observational errors and the variances of the random effects are the same for the first and the second treatment groups, the D-criterion for the prediction is given by
[TABLE]
where for . Then we obtain the optimal allocation rate , which is also optimal for the fixed-effects model.
Special case 2:
If the variances of the observational errors are the same for both groups and the dispersions and of random effects may be different, we receive the following D-criterion for the prediction:
[TABLE]
If we additionally assume different dispersions of random effects (), we obtain the next result for the optimal designs.
Theorem 6**.**
If the variances of the observational errors are the same and the dispersions of the random effects are different for the first and the second treatment groups, the D-optimal allocation rate for the prediction of the individual response parameters \mbox{\boldmath{\alpha}}=(\alpha_{1},...,\alpha_{N})^{\top} is given by
[TABLE]
where
[TABLE]
Note that the optimal allocation rate to the first group increases with and decreases with . It can be easily proved that is larger than if and smaller than if .
For further considerations we rewrite the optimal allocation rate (17) as a function of the ratio of the variances of random effects in the first and the second groups and the variance parameter :
[TABLE]
Than it is easy to verify that increases with for () and decreases for .
4.3 Numerical example
In this section we illustrate the obtained results for the prediction of the individual response parameters by a numerical example. We consider the two groups RCR model with individuals, observations per individual and the same variance of observational errors for both treatment groups: (special case 2). We fix the ratio of the variances of random effects in the first and the second groups by , and . Figures 1 and 2 illustrate the behavior of the optimal allocation rates for the A- and D-criteria in dependence of the rescaled random effects variance in the first group , which is monotonic in and has been used instead the of random effects variance itself to cover all values of the variance by the finite interval .
As we can observe on the graphics, the optimal allocation rate to the first group increases with the rescaled variance from for to for the A-criterion and to for the D-criterion for if . If , the optimal allocation rate decreases from to and for the A- and D-criterion, respectively. For the model coincides with that considered in special case 1 and the optimal design remains the same (==0.5) for all values of .
Figures 3 and 4 exhibit the efficiencies of the balanced design for the prediction in the two groups model for the A- and D-criteria. For computing the A- and D-efficiencies, we use the formulas
[TABLE]
and
[TABLE]
respectively.
As we can observe, the efficiency of the balanced design decreases with increasing values of from for to and if and to and if for the A- and D-criteria, respectively. For the balanced design is optimal for the prediction, which explains the efficiency equal to for all values of the variance.
5 Discussion
In this work we have considered RCR models with two treatment groups. We have obtained the A- and D-optimality criteria for the estimation of the population parameter and the prediction of the individual response. For a particular case of the same observational error variance for both groups, we illustrate the behavior of the optimal designs by a numerical example. The optimal allocation rate to the first treatment group turns out to be larger than if the variance of individual random effects in the first group is larger than in the second group. Otherwise, the optimal allocation rate is smaller than . The efficiency of the balanced design, which assigns equal group sizes, is relatively high only for small values of the variances of random effects. The efficiency decreases fast with increasing variance.
For simplicity, we have assumed a diagonal covariance matrix of random effects. For more general covariance structure further considerations are needed. We have also assumed the same number of observations for all individuals. Optimal designs for models with different numbers of observations for different individuals may be one of the next steps in the research. Moreover, optimal designs for RCR models with more than two groups can be investigated in the future. Furthermore, some research on more robust design criteria (for example, minimax or maximin efficiency), which are not sensible with respect to variance parameters, may be an interesting extension of this work.
Appendix A Proofs of Theorems 1-4
The two treatment groups RCR model described by formulas (1) and (2) may be recognized as a special case of the general linear mixed model
[TABLE]
with specific design matrices and for fixed and random effects, respectively. are the observational errors, denotes the fixed effects vector and are the random effects. The random effects and the observational errors are assumed to have zero mean and to be all uncorrelated with corresponding full rank covariance matrices \mbox{Cov}\,(\mbox{\boldmath{\gamma}})=\mathbf{G} and \mbox{Cov}\,(\mbox{\boldmath{\varepsilon}})=\mathbf{R}.
In model (20) the BLUE for and the BLUP for are solutions of the mixed model equations
[TABLE]
if the fixed effects design matrix has full column rank (see e. g. Henderson et al. (1959) and Christensen (2002)). According to Henderson (1975), the joint MSE matrix for both \hat{\mbox{\boldmath{\beta}}} and \hat{\mbox{\boldmath{\gamma}}} is given by
[TABLE]
To make use of the theoretical results available for the general linear mixed model, we rewrite the two groups RCR model in form (20):
[TABLE]
where \mbox{\boldmath{\beta}}=\mbox{\boldmath{\theta}}_{0}, \mbox{\boldmath{\gamma}}=\mbox{\boldmath{\theta}}-\left(\mathbf{1}_{N}\otimes\mathbf{I}_{2}\right)\mbox{\boldmath{\beta}}, \mbox{\boldmath{\theta}}=(\theta_{1},\dots,\theta_{N}) and denotes the -th unit vector. The covariance matrices of the random effects and the observational errors in model (23) are given by and , respectively.
Using formula (21) we obtain the BLUEs and for the fixed effects and the BLUPs
[TABLE]
and
[TABLE]
for the random effects. Then the BLUE and the BLUP for the contrasts and can be computed as and and result to formulas (3) and (5), respectively. Variance (4) of the estimator can be determined directly.
Using formula (22) we obtain the following joint MSE matrix for both \hat{\mbox{\boldmath{\beta}}} and \hat{\mbox{\boldmath{\gamma}}}:
[TABLE]
where
[TABLE]
[TABLE]
and
[TABLE]
for
[TABLE]
and
[TABLE]
The MSE matrix of the prediction \hat{\mbox{\boldmath{\theta}}} can be written in terms of joint MSE matrix (26):
[TABLE]
Using this formula we obtain
[TABLE]
where
[TABLE]
[TABLE]
and
[TABLE]
Then we present the MSE matrix of the predictor \hat{\mbox{\boldmath{\alpha}}} in form
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Bailey (2008) Bailey, R. A. (2008). Design of Comparative Experiments . Cambridge University Press.
- 2Christensen (2002) Christensen, R. (2002). Plane Answers to Complex Questions: The Theory of Linear Models . Springer, New York.
- 3Fedorov and Jones (2005) Fedorov, V. and Jones, B. (2005). The design of multicentre trials. Statistical Methods in Medical Research , 14 , 205–248.
- 4Gladitz and Pilz (1982) Gladitz, J. and Pilz, J. (1982). Construction of optimal designs in random coefficient regression models. Mathematische Operationsforschung und Statistik, Series Statistics , 13 , 371–385.
- 5Henderson (1975) Henderson, C. R. (1975). Best linear unbiased estimation and prediction under a selection model. Biometrics , 31 , 423–477.
- 6Henderson et al. (1959) Henderson, C. R., Kempthorne, O., Searle, S. R., and von Krosigk, C. M. (1959). The estimation of environmental and genetic trends from records subject to culling. Biometrics , 15 , 192–218.
- 7Kunert et al. (2010) Kunert, J., Martin, R. J., and Eccleston, J. (2010). Optimal block designs comparing treatments with a control when the errors are correlated. Journal of Statistical Planning and Inference , 140 , 2719–2738.
- 8Piepho and Möhring (2010) Piepho, H. P. and Möhring, J. (2010). Generation means analysis using mixed models. Crop Science , 50 , 1674–1680.
