Admissibility of invariant tests for means with covariates
Ming-Tien Tsai

TL;DR
This paper investigates the admissibility of Rao's invariant tests for mean vectors with covariates in a multinormal setting, showing Rao's U-test is admissible when incorporating specific information, unlike Hotelling's T-squared test.
Contribution
It proves the admissibility of Rao's U-test in a multinormal model with covariates, contrasting with the inadmissibility of Hotelling's T-squared test.
Findings
Rao's U-test is admissible when fully incorporating covariate information.
Hotelling's T-squared test is shown to be inadmissible in this setting.
The union-intersection principle is key to the admissibility results.
Abstract
For a multinormal distribution with a -dimensional mean vector and an arbitrary unknown dispersion matrix , Rao ([9], [10]) proposed two tests for the problem of testing , where . These tests are referred to as Rao's -test (likelihood ratio test) and Rao's -test (union-intersection test), respectively. This work is inspired by the well-known work of Marden and Perlman [6] who claimed that Hotelling's -test is admissible while Rao's -test is inadmissible. Both Rao's -test and Hotelling's -test can be constructed by applying the union-intersection principle that incorporates the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Admissibility of invariant tests for means with covariates
Ming-Tien Tsai
Institute of Statistical Science, Academia Sinica, Taipei 11529, R.O.C.
For a multinormal distribution with a -dimensional mean vector and an arbitrary unknown dispersion matrix , Rao ([9], [10]) proposed two tests for the problem of testing H_{0}:{\mbox{\boldmath\theta}}_{1}={\bf 0},{\mbox{\boldmath\theta}}_{2}={\bf 0},{\mbox{\boldmath\Sigma}}~{}\hbox{unspecified},~{}\hbox{versus}~{}H_{1}:{\mbox{\boldmath\theta}}_{1}\neq{\bf 0},{\mbox{\boldmath\theta}}_{2}={\bf 0},{\mbox{\boldmath\Sigma}}~{}\hbox{unspecified}, where {\mbox{\boldmath\theta}}^{{}^{\prime}}=({\mbox{\boldmath\theta}}^{{}^{\prime}}_{1},{\mbox{\boldmath\theta}}^{{}^{\prime}}_{2}). These tests are referred to as Rao’s -test (likelihood ratio test) and Rao’s -test (union-intersection test), respectively. This work is inspired by the well-known work of Marden and Perlman [6] who claimed that Hotelling’s -test is admissible while Rao’s -test is inadmissible. Both Rao’s -test and Hotelling’s -test can be constructed by applying the union-intersection principle that incorporates the information {\mbox{\boldmath\theta}}_{2}={\bf 0} for Rao’s -test statistic but does not incorporate it for Hotelling’s -test statistic. Rao’s -test is believed to exhibit some optimal properties. Rao’s -test is shown to be admissible by fully incorporating the information {\mbox{\boldmath\theta}}_{2}={\bf 0}, but Hotelling’s -test is inadmissible.
AMS subject classification. 62C15, 62H15.
Keywords: -admissibility; -admissibility; Generalized inverse of matrix, Matrix-concave, Matrix-convex.
Abbreviated title: Admissibility of invariant tests for means with covariates.
1. Introduction
Let {} be independent and identically distributed random vectors (i.i.d.r.v.) with a -variate normal distribution with mean vector and dispersion matrix , where is assumed to be positive definite (p.d.). Partition and as
[TABLE]
where {\mbox{\boldmath\theta}}_{1}:p_{1}\times 1,{\mbox{\boldmath\theta}}_{2}:p_{2}\times 1,{\mbox{\boldmath\Sigma}}_{11}:p_{1}\times p_{1},{\mbox{\boldmath\Sigma}}_{22}:p_{2}\times p_{2},p_{1}+p_{2}=p, The problem of interest is to test
[TABLE]
For every , let
[TABLE]
and express Hotelling’s -statistic as
[TABLE]
Partition and similarly as in (1.1), and define
[TABLE]
For the problem (1.2), Rao ([9], [10]) proposed two test statistics which are of the forms
[TABLE]
and
[TABLE]
respectively. In the literature, these two test statistics are called Rao’s W and U statistics, respectively. The test statistics is derived by the likelihood ratio principle. Marden and Perlman [6] showed that for problem (1.2) both Rao’s -test and Rao’s -test are similar and unbiased.
The invariance of problem (1.2) under a group of linear transformations, where is the group of nonsingular matrices of the form
[TABLE]
with and , can be exploited so that the group acts on the sample space via {\bf g}:({\mbox{\bar{\bf X}}},~{}{\bf S})\to({\bf g}{\mbox{\bar{\bf X}}},~{}{\bf g}{\bf S}{\bf g}^{{}^{\prime}}), and on the parameter space via {\bf g}:({\mbox{\boldmath\theta}},~{}{\mbox{\boldmath\Sigma}})\to({\bf g}{\mbox{\boldmath\theta}},~{}{\bf g}{\mbox{\boldmath\Sigma}}{\bf g}^{{}^{\prime}}). Let {\mbox{\boldmath\theta}}_{1:2} and {\mbox{\boldmath\Sigma}}_{11:2} be defined similarly as in (1.5) and (1.6) but such that and replace and , respectively. Adopting the notions of Marden and Perlman [6], the maximal invariant statistic is the pair with defined as in (1.7) and M=n(n-1){\mbox{\bar{\bf X}}}^{\prime}_{2}{\bf S}^{-1}_{22}{\mbox{\bar{\bf X}}}_{2}, and correspondingly, the maximal invariant parameter is the pair with {\Delta}_{1}=n{\mbox{\boldmath\theta}}^{{}^{\prime}}_{1:2}{\mbox{\boldmath\Sigma}}^{-1}_{11:2}{\mbox{\boldmath\theta}}_{1:2} and {\Delta}_{2}=n{\mbox{\boldmath\theta}}^{{}^{\prime}}_{2}{\mbox{\boldmath\Sigma}}^{-1}_{22}{\mbox{\boldmath\theta}}_{2}.
Using only -invariant tests, Marden and Perlman ([6], p. 27) concluded that the problem (1.2) reduces to that of testing
[TABLE]
based on , where {\Delta}=n{\mbox{\boldmath\theta}}^{{}^{\prime}}_{1}{\mbox{\boldmath\Sigma}}^{-1}_{11:2}{\mbox{\boldmath\theta}}_{1}. Marden and Perlman [6] established the necessary and sufficient conditions for the admissibility of problem (1.10). They considered the homeomorphic transformations of , and inferred that Rao’s -test is inadmissible when only -invariant tests are applied to problem (1.2), but the overall Hotelling -test is admissible. However, their conclusions are against our statistical intuition. Also, their simulation studies (see Tables 4.1a-4.1c) indicate a totally different story. As such, this work attempts to clarify these phenomena.
First, note that if only the -invariant tests are applied, then problem (1.2) does not reduce to that of testing problem (1.10) but should reduce to that of testing
[TABLE]
Moreover, problems (1.11) and (1.10) are not identical. Problem (1.11) is easily seen to imply problem (1.10), but not vice versa. Problem can be regarded as the union of problem and the following subproblems: (i). (ii). and (iii). . Problem provides more insight than does problem into both in the null hypothesis and in the alternative hypothesis. In fact, problem is a two-dimensional testing problem in which is the nuisance parameter, while problem is a one-dimensional testing problem. Problem is a subproblem of problem , so intuitively the optimal tests for problem (which is equivalent to the hypothesis testing problem (1.2)) are not necessarily optimal for problem , and vice versa. Marden and Perlman [6] also clearly made this point (for details see the last three lines of page 28 of Marden and Perlman [6]). Therefore, based on the optimal criterions set up for the two-dimensional testing problem (1.10), to infer problem (1.11), which is only a one-dimensional testing problem, the conclusions drawn may provide misleading messages. To ensure the information \mbox{\boldmath\mu}_{2}={\bf 0} (i.e., ) being incorporated for problem , Marden and Perlman [6] further made an assumption that is an ancillary statistic. Because the density function of the ancillary statistic does not depend on the parameter both under the null hypothesis and under the alternative hypothesis, and hence the whole statistical inference should depend on the conditional density function of given , which is a noncentral -type distribution, in their set up. However, for the case of Hotelling’s -test, Marden and Perlman ([6], page 50) adopted the exponential family for the problem (1.10) [not for the problem (1.11)] set up for their statistical inference.
Let be the general linear group. The problem of testing H^{u}_{0}:{\mbox{\boldmath\theta}}={\bf 0} versus H^{u}_{1}:{\mbox{\boldmath\theta}}\neq{\bf 0} is -invariant. When only -invariant tests are performed, this problem reduces to that of testing , where . For this -invariant testing problem, Hotelling’s -test is well-known to be the uniformly most powerful test (Simaika [14]), and so is admissible. Schwartz [13] employed the Birnbaum-Stein method (Birnbaum [2], Stein [15]) to study the admissibility of fully -invariant tests in the multivariate analysis of variance setting. Problem (1.2) is not -invariant, although it is -invariant. Therefore, studying the power domination problems of Hotelling’s -test, Rao’s -test and Rao’s -test for problem (1.2) via the fully -invariant approach may not be helpful. The group is amenable and meets the conditions of the Hunt-Stein theorem (Lehmann [5]). Therefore, any minimax questions in problem (1.2) can be reduced by the group .
Notably, problems (1.2) and (1.11) are the problems of testing against restricted alternatives. However, problem (1.10) is not such a problem. Therefore, by utilizing problem (1.10) to draw inferences concerning problem (1.2), we may overlook the intrinsic nature of the restricted alternative (because , which is determined directly from the basic assumption \mbox{\boldmath\theta}_{2}={\bf 0}) when applying results in the literature or developing new theories.
The exponential structure of the distribution of (\mbox{\bar{\bf X}},{\bf S}) is incorporated to generalize the Birnbaum-Stein method for problem (1.2). As a result, Section 2 presents two main results: the acceptance region of Rao’s -test is convex, and Rao’s -test is admissible. Section 3 applies Eaton’s [3] results to show that Hotelling’s -test is inadmissible for problem (1.2). The discussion regarding the Rao -test is given in Section 4. Some general remarks are given in the final section. The Appendix provides six lemmas to show the convexity of the acceptance region of Rao’s -test.
2. Admissibility of Rao’s -test
Stein [15] proved in detail that Hotelling’s -test is admissible for the problem of testing H^{u}_{0}:{\mbox{\boldmath\theta}}={\bf 0} against the global alternative H^{u}_{1}:{\mbox{\boldmath\theta}}\neq{\bf 0}. This proof can also be found in Anderson’s book ([1], p. 188-190). The main purpose of this section is to incorporate the Birnbaum-Stein method to demonstrate that Rao’s -test is admissible for problem (1.2). Whether the acceptance region of Rao’s -test is a convex set must first be determined. Let
[TABLE]
for a suitable k, and
[TABLE]
which is positive semi-definite (p.s.d.) and of rank . Also let
[TABLE]
Let be the Moore-Penrose generalized inverse of . Then,
[TABLE]
which is shown to be the Moore-Penrose generalized inverse of in Lemma 2 of the Appendix, is easily established. For the notions related to the Moore-Penrose generalized inverse of matrices and matrix-convex (matrix-concave) functions, refer to Rao and Mitra [11] and Marshall and Olkin [7], respectively. The Appendix develops six lemmas related to the Moore-Penrose generalized inverse of matrices, matrix-convex and matrix-concave.
Anderson ([1], problem 17 of page 193) claimed that the acceptance region of Hotelling -test
[TABLE]
Notably, by (2.1) and (2.2) the accepted region of Rao’s -test can be rewritten as
[TABLE]
where is defined in (2.2). Let denote that the matrix is p.s.d. throughout this paper. The lemmas developed in the Appendix are used to generalize Anderson’s result (2.5) to the following theorem.
Theorem 1. Let be the set of all positive definite matrices. Then {\mathcal{A}}_{U}=\{\,(\mbox{\bar{\bf X}},{\bf S})\mid n(n-1)\mbox{\bar{\bf X}}_{1:2}^{\prime}{\bf S}_{11:2}^{-1}\mbox{\bar{\bf X}}_{1:2}\leq k\,\} is convex on .
Proof. By (2.6), {\mathcal{A}}_{U}=\{\,(\mbox{\bar{\bf X}},{\bf S})\mid n(n-1)\mbox{\bar{\bf X}}^{\prime}{\bf B}^{+}({\bf S})\mbox{\bar{\bf X}}\leq k\,\}. Let be defined as in (2.3), then by Lemma 2 in the Appendix, it is the Moore-Penrose generalized inverse of . Lemma shows that is matrix concave on , that is,
[TABLE]
Therefore, by Lemma 5,
[TABLE]
By the inequality (2.8) and Lemma 6,
[TABLE]
For the definitions of -admissible and -admissible, we may refer to the page 306 of Lehmann [6].
Remark. For testing against global alternative, (i.e., H_{0}:{\mbox{\boldmath\theta}}={\bf 0} against H_{1}:{\mbox{\boldmath\theta}}\neq{\bf 0}), since the Hotelling’s -test statistic is the monotone function of , where , Stein worked on the space . Theorem 5.6.6 of Anderson [1] does not require that it is necessary to work on the space . Note that for the testing H_{0}:{\mbox{\boldmath\theta}}_{1}={\bf 0},{\mbox{\boldmath\theta}}_{2}={\bf 0} against the restricted alternative H_{1}:{\mbox{\boldmath\theta}}_{1}\neq{\bf 0},{\mbox{\boldmath\theta}}_{2}={\bf 0}, the Rao’s -test statistic is no longer to be a monotone function of any more. To overcome the difficulty, we work on the space instead. Let {\mathcal{H}}^{*}_{c}=\{\,(\mbox{\bar{\bf X}},{\bf S})\mid n{\mbox{\boldmath\theta}}^{{}^{\prime}}{\mbox{\boldmath\Sigma}}^{-1}\mbox{\bar{\bf X}}-\frac{1}{2}\operatorname{tr}({\mbox{\boldmath\Sigma}}^{-1}{\bf S})>c\,\}. It is easy to note that is a half-space, and hence the assumption that the acceptance region is disjoint with the half-space on the space holds. Let
[TABLE]
Note that (\mbox{\bar{\bf X}},{\bf S})\in{\mathcal{H}}^{a}_{c} implies that (\mbox{\bar{\bf X}},{\bf S})\in{\mathcal{H}}^{*}_{c}. Thus,
[TABLE]
And hence, the intersection of and is also empty. As such, we may work the proof of Theorem 2 on the space .
Theorem 2. For the problem (1.2), Rao’s -test is -admissible.
Proof. The likelihood function of , , is
[TABLE]
Let , where \boldsymbol{w}^{(1)}=\mbox{\boldmath\Sigma}^{-1}\mbox{\boldmath\theta} and , where (\sigma^{ij})=\mbox{\boldmath\Sigma}^{-1}. Let \boldsymbol{w}^{(1)^{\prime}}=\mbox{\boldmath\nu}^{\prime}=(\mbox{\boldmath\nu}_{1}^{\prime},\mbox{\boldmath\nu}_{2}^{\prime})^{\prime}. By Theorem 1, is convex on . Consider the other condition of theorem 5.6.5. (Anderson [1]), is assumed to be disjoint with the subspace
[TABLE]
where is symmetric, for some .
Theorem 8 presented by Lehmann ([5], page 307) can be applied if can be demonstrated for , which can be accomplished with the following two steps: (I) {\bf I}+\lambda\mbox{\boldmath\Lambda} is p.d., and (II) \mbox{\boldmath\theta}_{0}+\lambda\mbox{\boldmath\theta}\in H_{1} for .
(I). That {\bf I}+\lambda\mbox{\boldmath\Lambda} is p.d. is shown if can be shown to be p.s.d., for . Suppose that is not p.s.d., then by arguments similar to those in Anderson ([1], p. 189-190) it can be written
[TABLE]
where is nonsingular. Let \mbox{\bar{\bf X}}=(1/{\gamma}){\bf X}_{0} and
[TABLE]
Then
[TABLE]
which is greater than c for sufficiently large . Thus, the subspace reduces to
[TABLE]
for sufficiently large . Obviously, as approaches infinity. Now, let
[TABLE]
with . Then, (2.16) reduces to
[TABLE]
for sufficiently large . It can be easily seen that for sufficiently large . Furthermore,
[TABLE]
Thus, if is not p.s.d., then
[TABLE]
which leads to a contradiction. Therefore, is p.s.d..
To proceed with step (II), note that {\bf I}+{\mbox{\boldmath\Lambda}} is p.d., and without loss of generality, its inverse can be denoted by , so ({\bf I}+{\mbox{\boldmath\Lambda}})^{-1}={\mbox{\boldmath\Sigma}}. Some more notation is needed. Let
[TABLE]
and write
[TABLE]
Notably, \widetilde{\mbox{\boldmath\Sigma}}=\widetilde{\mbox{\boldmath\Sigma}}^{\prime}. Let {\bf Z}=\boldsymbol{g}\mbox{\bar{\bf X}} and . Then,
[TABLE]
[TABLE]
and
[TABLE]
[TABLE]
Thus, the subspace becomes
[TABLE]
Equation (2.25) indicates that the hypothesis testing problem (1.2) H_{0}:{\mbox{\boldmath\theta}}_{1}={\bf 0},{\mbox{\boldmath\theta}}_{2}={\bf 0} versus H_{1}:{\mbox{\boldmath\theta}}_{1}\neq{\bf 0},{\mbox{\boldmath\theta}}_{2}={\bf 0} is equivalent to the hypothesis testing problem
[TABLE]
And hence to show that \mbox{\boldmath\theta}_{0}+\lambda\mbox{\boldmath\theta}\in H_{1}, for problem (1.2) is equivalent to showing that \widetilde{}\mbox{\boldmath\nu}_{0}+\lambda\widetilde{}\mbox{\boldmath\nu}\in H_{1}, for problem (2.27). Next, step (II) is considered.
(II). To show that \widetilde{}\mbox{\boldmath\nu}_{0}+\lambda\widetilde{}\mbox{\boldmath\nu}\in H_{1} for , the aim is to demonstrate that \widetilde{\mbox{\boldmath\nu}}_{1}\neq{\bf 0} and \widetilde{\mbox{\boldmath\nu}}_{2}={\bf 0}. If the statement that \widetilde{\mbox{\boldmath\nu}}_{1}\neq{\bf 0} and \widetilde{\mbox{\boldmath\nu}}_{2}={\bf 0} is not true, then, there are three cases (i) \widetilde{\mbox{\boldmath\nu}}_{1}\neq{\bf 0}, \widetilde{\mbox{\boldmath\nu}}_{2}\neq{\bf 0}, (ii) \widetilde{\mbox{\boldmath\nu}}_{1}={\bf 0}, \widetilde{\mbox{\boldmath\nu}}_{2}\neq{\bf 0} and (iii) \widetilde{\mbox{\boldmath\nu}}_{1}={\bf 0}, \widetilde{\mbox{\boldmath\nu}}_{2}={\bf 0}. We assume that (i), (ii) and (iii) are true, and then show that those assumptions to lead to contradictions. To proceed, it is enough to consider the situation that {\bf S}={\mbox{\boldmath\Sigma}}.
Note that given {\bf S}={\mbox{\boldmath\Sigma}}, then
[TABLE]
Accordingly, both sets and are reduced to p-dimensional sets (using the notation loosely)
[TABLE]
and
[TABLE]
respectively. Notably, given that {\bf S}=\mbox{\boldmath\Sigma}, the assumption that the sets and are disjoint still holds.
First, (i) \widetilde{\mbox{\boldmath\nu}}_{1}\neq{\bf 0}, \widetilde{\mbox{\boldmath\nu}}_{2}\neq{\bf 0} is assumed. In the problem of testing : \widetilde{\mbox{\boldmath\nu}}_{1}={\bf 0} versus : \widetilde{\mbox{\boldmath\nu}}_{1}\neq{\bf 0}, whenever \widetilde{\mbox{\boldmath\nu}}_{1}\neq{\bf 0}, then there exists a constant which does not depend on and \mbox{\boldmath\Sigma}_{11:2} such that
[TABLE]
for any , where
[TABLE]
and
[TABLE]
This is equivalent to the existence of a constant such that
[TABLE]
for any . Rewrite as
[TABLE]
Consider
[TABLE]
Then, from equation (2.34),
[TABLE]
where
[TABLE]
Hence,
[TABLE]
Let
[TABLE]
\mbox{\boldmath\Sigma}_{22}\widetilde{\mbox{\boldmath\nu}}_{2}\in{\mathcal{B}}(\mbox{\boldmath\Sigma}_{22}\widetilde{\mbox{\boldmath\nu}}_{2}), so {\mathcal{B}}(\mbox{\boldmath\Sigma}_{22}\widetilde{\mbox{\boldmath\nu}}_{2})\neq\varnothing; and {\bf Z}_{2}\in{\mathcal{B}}(\mbox{\boldmath\Sigma}_{22}\widetilde{\mbox{\boldmath\nu}}_{2}) implies that . Let
[TABLE]
Now, (2.39) implies
[TABLE]
Notably,
[TABLE]
Thus,
[TABLE]
Notably,
[TABLE]
Namely, if \widetilde{\mbox{\boldmath\nu}}_{1}\neq{\bf 0} and \widetilde{\mbox{\boldmath\nu}}_{2}\neq{\bf 0}, then . This implies that , and leads to a contradiction.
Next, (ii) \widetilde{\mbox{\boldmath\nu}}_{1}={\bf 0}, \widetilde{\mbox{\boldmath\nu}}_{2}\neq{\bf 0} is assumed. Then, the set in equation (2.35) reduces to
[TABLE]
In passing, the condition n\widetilde{\mbox{\boldmath\nu}}_{2}^{\prime}\mbox{\boldmath\Sigma}_{22}\widetilde{\mbox{\boldmath\nu}}_{2}-p-2c>0 is needed to ensure that is not an empty set. Consider {\bf Z}_{2}=\mbox{\boldmath\Sigma}_{22}\widetilde{\mbox{\boldmath\nu}}_{2}; notably, (a) is not an empty set and (b) , which implies that . This leads to a contradiction.
Finally, in case (iii) \widetilde{\mbox{\boldmath\nu}}_{1}={\bf 0} and \widetilde{\mbox{\boldmath\nu}}_{2}={\bf 0}. Then, the set in equation (2.35) reduces to
[TABLE]
In this case, is required to ensure that is not an empty set. That can be easily seen, so . This leads to a contradiction.
The discussions of (i), (ii) and (iii) can be taken together to imply that
[TABLE]
Therefore, Rao’s -test satisfies the conditions of Theorem 8 of Lehmann ([5], pages 307). Marden and Perlman [6] have shown that Rao’s -test is similar and unbiased, and the theorem follows by Corollary 2 of Lehmann ([5], page 308).
3. Inadmissibility of Hotelling’s -test
Marden and Perlman ([6], p. 49) pointed out that, “by utilizing the exponential structure of the distribution of (\mbox{\bar{\bf X}},{\bf S}), the method of Stein [15] and Schwartz [13] can be applied to reveal the overall test is admissible for problem (1.2). Based on the logarithm of the joint density of (\mbox{\bar{\bf X}},{\bf S}), Marden and Perlman ([6], pages 49-50) claimed that, according to the theorem of Stein [15], the set (in our notation)
[TABLE]
is an admissible acceptance region in problem (1.2) for any subset {\mbox{\boldmath\Theta}}_{2}\subset{\mbox{\boldmath\Theta}}_{1}, where {\mbox{\boldmath\Theta}}_{1}=\{\,(\mbox{\boldmath\theta},\mbox{\boldmath\Sigma})\mid{\mbox{\boldmath\theta}}_{1}\neq{\bf 0},{\mbox{\boldmath\theta}}_{2}={\bf 0},{\mbox{\boldmath\Sigma}}~{}\hbox{p.d.}\}. Note that the notion in (3.1) is essentially the same as the one presented in Marden and Perlman ([6], page 50), but omits the terms for which the parameters and statistics can be separated. The method presented in (3.1) is easier to handle. However, Marden and Perlman [6] did not offer an analytical proof for their assertion. Note that the problems considered by Stein [15] and Schwartz [13] are fully -invariant. For the -invariant models considered by Stein [15], {\mbox{\boldmath\Theta}}_{1}=\{\,(\mbox{\boldmath\theta},\mbox{\boldmath\Sigma})\mid{\mbox{\boldmath\theta}}\neq{\bf 0},{\mbox{\boldmath\Sigma}}~{}\hbox{p.d.}\} for the problem of testing H^{u}_{0}:{\mbox{\boldmath\theta}}={\bf 0} against the global alternative H^{u}_{1}:{\mbox{\boldmath\theta}}\neq{\bf 0}. Note that and are independent, and and are orthogonal. Take {\mbox{\boldmath\Theta}}_{2}=\{\,(\mbox{\boldmath\theta},\mbox{\boldmath\Sigma})\mid{\mbox{\boldmath\theta}}\neq{\bf 0},{\mbox{\boldmath\theta}}^{{}^{\prime}}{\mbox{\boldmath\Sigma}}^{-1}{\mbox{\boldmath\theta}}=1,{\mbox{\boldmath\Sigma}}~{}\hbox{p.d.}\}, substituting the estimator of into and adopting the notation defined in Section 2, yields
[TABLE]
[TABLE]
where \widetilde{\bf X}=({\mbox{\bar{\bf X}}}^{{}^{\prime}}_{1:2},{\mbox{\bar{\bf X}}}^{{}^{\prime}}_{2})^{{}^{\prime}},\widetilde{{\bf S}}=\hbox{diag}({\bf S}_{11.2},{\bf S}_{22}) and . The set (3.2) is equivalent to the acceptance region of Hotelling’s -test.
Note that problem (1.2) is not -invariant, although it is -invariant. Marden and Perlman [6] transformed set (3.1) into a -invariant set to work out set (3.1) when {\mbox{\boldmath\Theta}}_{2}={\mbox{\boldmath\Theta}}_{1}, and reached the conclusion that the -invariant set that corresponds to set (3.1) is equivalent to the acceptance region of Hotelling’s -test. However, in their derivations (Marden and Perlman [6], p. 50) the restriction ( defined in Section 1) had to be imposed, thus corresponding to the assumed condition that {\mbox{\boldmath\theta}}_{2}={\bf 0} in set (3.1) was overlooked in their new -invariant set. Rather than focusing only on the -invariant set, this work directly determines the form of set (3.1) when {\mbox{\boldmath\Theta}}_{2}=\{\,(\mbox{\boldmath\theta},\mbox{\boldmath\Sigma})\mid{\mbox{\boldmath\theta}}_{1}\neq{\bf 0},{\mbox{\boldmath\theta}}_{2}={\bf 0},{\mbox{\boldmath\theta}}_{1:2}^{{}^{\prime}}{\mbox{\boldmath\Sigma}}_{11:2}^{-1}{\mbox{\boldmath\theta}}_{1:2}=1,{\mbox{\boldmath\Sigma}}~{}\hbox{p.d.}\}. Similar to arguments above, (3.1) then becomes
[TABLE]
Notably, the set (3.3) is also a -invariant set, but it is not equivalent to the acceptance region of Hotelling’s -test
[TABLE]
for a suitable . Note that, (3.3) is obtained by using the information that {\mbox{\boldmath\theta}}_{2}={\bf 0}, but (3.2) is obtained without using that information.
Due to the fact that and are independent, and and are orthogonal; based on the above discussions, an admissible acceptance region for the problem of testing H^{u}_{0}:{\mbox{\boldmath\theta}}={\bf 0} against the global alternative H^{u}_{1}:{\mbox{\boldmath\theta}}\neq{\bf 0} can be simply taken as
[TABLE]
for a suitable . For problem (1.2), another admissible acceptance region is of the form
[TABLE]
which is the acceptance region of Rao’s -test. Instead, based on the unproved assertion (3.1), we have provided an analytical proof that Rao’s -test is admissible for problem (1.2) by using the Birnbaum-Stein method in Section 2.
Notably, for each , and can be obtained by maximizing [n(n-1)]^{1/2}{\bf b}^{\prime}({\mbox{\bar{\bf X}}}^{{}^{\prime}}_{1:2},{\mbox{\bar{\bf X}}}^{{}^{\prime}}_{2})^{{}^{\prime}} under the condition that is constant over the sets and , respectively. Thus, both Hotelling’s -test statistic and Rao’s -test statistic can be constructed by applying the union-intersection (UI) principle of Roy [12] for the problem of testing H^{u}_{0}:{\mbox{\boldmath\theta}}={\bf 0} against the global alternative H^{u}_{1}:{\mbox{\boldmath\theta}}\neq{\bf 0}, and the problem (1.2) of testing H_{0}:{\mbox{\boldmath\theta}}_{1}={\bf 0},{\mbox{\boldmath\theta}}_{2}={\bf 0} against the alternative H_{1}:{\mbox{\boldmath\theta}}_{1}\neq{\bf 0},{\mbox{\boldmath\theta}}_{2}={\bf 0}, respectively. Therefore, for problem (1.2) Rao’s test based on may be regarded as a UI test. Although the Rao -test statistic is constructed by incorporating the information of {\mbox{\boldmath\theta}}_{2}={\bf 0}~{}(\widetilde{\mbox{\boldmath\nu}}_{2}={\bf 0}), the Hotelling -test statistic is not thus determined. Therefore, Hotelling’s -test may be reasonably thought to be dominated by Rao’s -test for problem (1.2). This assertion can be numerically confirmed by the results of Tables 4.1a, 4.1b and 4.1c of Marden and Perlman [6]. The Birnbaum-Stein method fails to determine whether Hotelling’s -test is inadmissible for problem (1.2). This shortcoming is overcomed herein by applying Eaton’s [3] basic results to an essentially complete class of test functions for problem (1.2). Let be Eaton’s essentially complete class of tests, so for any test , there exists a test such that is at least as good as .
Theorem 3. For the problem (1.2), Hotelling’s -test is inadmissible.
Proof. Following Eaton [3], the following is defined.
[TABLE]
Let be the smallest closed convex cone that contains . Then the dual cone of is defined as
[TABLE]
Notably, that is contained in some half-space is not a necessary condition but a sufficient condition that ensures the dual cone is a non-empty set. Note that although is unknown, but it is fixed. By (3.5) and (3.6), thus we have
[TABLE]
which is contained in a half-space of . Similarly, its dual cone is
[TABLE]
where
[TABLE]
Therefore, , which is an unbounded set.
The acceptance region of Hotelling’s -test is given by
[TABLE]
where is the upper % point of the null hypothesis distribution of (which is linked to a F-distribution). For fixed , is an ellipsoidal set with origin , and is bounded, whereas , as shown above, is still unbounded. Therefore, the proposition 2.1 of Eaton that the dual cone should be a subset of the acceptance region of Hotelling’s -test (Eaton [3], section 4, p. 1887) is not tenable, and thus Hotelling’s -test is not a member of an essentially complete class.
4. Whither Rao’s -test?
In passing, both the Hotelling -test statistic and the Rao -test statistic can be obtained by applying the likelihood ratio principle. The Rao -test statistic is constructed by incorporating the information that {\mbox{\boldmath\theta}}_{2}={\bf 0}, but the Hotelling -test statistic is not thus obtained. For problem (1.10), Marden and Perlman [6] adopted the generalized Bayes approach to show that Rao’s -test is admissible when and is inadmissible when . Section 1 stated that restricting problem (1.2) to -invariant tests does not reduce it to problem (1.10), but should reduce it to problem (1.11), and that problem (1.11) and problem (1.10) differ. Thus, the optimal criteria established for problem (1.10) to draw inferences for problem (1.2) may lead to conclusions that convey misleading messages. The generalized Bayes approach of Marden and Perlman [6] can be adopted to characterize in parallel the sufficient and necessary conditions for the admissibility of the problem (1.11). A situation in which the Rao -test can be further demonstrated to be a generalized Bayes test, and the corresponding optimality conditions for the problem (1.11) can be satisfied, can lead to completion of the task. Birnbaum [2], in the context of complete class type theorems, noted that for testing H^{u}_{0}:{\mbox{\boldmath\theta}}={\bf 0} versus H^{u}_{1}:{\mbox{\boldmath\theta}}\neq{\bf 0}, a test is admissible if and only if it is a generalized Bayes test. Some admissible tests in the literature are not the Bayes tests for other hypothesis testing problems (Oosterhoff [8], p.82). For problem (1.2), the set of proper Bayes tests and their weak limits might only constitute a proper subset of an essentially complete class of tests. On the other hand, the Birnbaum-Stein method stipulated the convexity assumption for the acceptance regions of tests. However, for problem (1.2) the acceptance region of Rao’s -test is a hyperbolic type set, which is no longer convex. Therefore, the Birnbaum-Stein method fails to be applicable to Rao’s -test. A future study will investigate the problem of the optimality of Rao’s -test.
5. Some remarks
The Hotelling -test enjoys many optimal properties of the Neyman-Pearson hypothesis testing theory when testing against the global alternative. These include similarity, unbiasedness, power monotonicity, most stringency, uniformly most powerful invariant and alpha-admissibility etc.. However, it is still open to debate whether the Hotelling -test is minimax. For the hypothesis testing problem (1.2) we show that the Hotelling -test is not a member of an essentially complete class (Eaton [3]), and hence it is no longer admissible. Moreover, we adopt the Birnbaum-Stein method (Stein [15]) to demonstrate that the Rao -test is admissible for the hypothesis testing problem (1.2).
Consider the hypotheses
[TABLE]
where denotes a closed convex cone containing a p-dimensional open set. Denote the positive orthant space by {\cal O}_{p}^{+}=\{{\mbox{\boldmath\theta}}\in R^{p}|~{}{\mbox{\boldmath\theta}}\geq{\bf 0}\}. Notice that when is a proper set contained in a halfspace, under a suitable linear transformation the problem in (5.1) can be reduced to the problem for testing against the positive orthant space with another unknown positive definite covariance matrix. When is a specific halfspace, then it can be transformed into another halfspace by a non-singular linear transformation. Hence, without loss of generality it is sufficient to study the cases in which is the positive orthant space and is the halfspace {\cal H}_{p}^{*}=\{{\mbox{\boldmath\theta}}\in R^{p}|~{}{\theta}_{p}\geq 0\}. Note that the hypothesis testing problem (1.2) is a special case of the hypothesis testing problem (5.1). Therefore, we will further study whether the property of -admissibility of the UIT and LRT for the problem of testing against the closed convex cone holds in the near future.
Appendix
The following six lemmas are established to prove Theorem 1.
Lemma 1. * is convex on *.
Proof. For any two given matrices and , let \mbox{\boldmath\Psi}^{+}(\alpha)={\bf B}^{+}(\alpha{\bf S}+(1-\alpha){\bf T}), where . Then,
[TABLE]
Write
[TABLE]
and
[TABLE]
Since is p.s.d., thus
[TABLE]
Therefore,
[TABLE]
where denotes the largest eigenvalue of . Suppose that is p.d., is p.s.d. and write , then note that implies that . Thus, . Note that,
[TABLE]
where
[TABLE]
Next, compare the matrices
[TABLE]
and
[TABLE]
that is,
[TABLE]
and
[TABLE]
Consider the new matrix , where
[TABLE]
The matrix can be rewritten as {\bf L}={\bf P}\mbox{\boldmath\Lambda}{\bf Q}^{\prime}, where denotes the diagonal matrix of eigenvalues of and , , the group of orthogonal matrices. Thus, {\bf L}{\bf L}^{\prime}={\bf P}\mbox{\boldmath\Lambda}^{2}{\bf P}^{\prime}. Therefore, \lambda_{max}({\bf L}{\bf L}^{\prime})=\lambda_{max}({\bf P}\mbox{\boldmath\Lambda}^{2}{\bf P}^{\prime})=\lambda_{max}(\mbox{\boldmath\Lambda}^{2}{\bf P}^{\prime}{\bf P})=\lambda_{max}(\mbox{\boldmath\Lambda})^{2}=[\lambda_{max}(\mbox{\boldmath\Lambda})]^{2}=[\lambda_{max}({\bf L})]^{2}. Notably,
[TABLE]
the last inequality follows from the inequality (A.1). Thus, . Furthermore, notice that
[TABLE]
Thus,
[TABLE]
Therefore,
[TABLE]
and hence
[TABLE]
Lemma 2. * is the Moore-Penrose generalized inverse of *.
Proof. It can be easily shown that satisfies the following conditions:
- (i)
, 2. (ii)
, 3. (iii)
, 4. (iv)
{\bf B}^{+}({\bf S}){\bf B}({\bf S})=\big{(}{\bf B}^{+}({\bf S}){\bf B}({\bf S})\big{)}^{\prime}.
Lemma 3. For any two given matrices and , let \mbox{\boldmath\Psi}^{+}(\alpha)={\bf B}^{+}(\alpha{\bf S}+(1-\alpha){\bf T}), where . Then \frac{\mathop{d}^{2}\mbox{\boldmath\Psi}^{+}(\alpha)}{\mathop{d}\alpha^{2}}\succeq\left.\frac{\mathop{d}^{2}\mbox{\boldmath\Psi}^{+}(\alpha)}{\mathop{d}\alpha^{2}}\right|_{\alpha=\alpha_{0}}, , where is the stationary point of \mbox{\boldmath\Psi}^{+}(\alpha). Let \mbox{\boldmath\Psi}(\alpha) be the Moore-Penrose generalized inverse of \mbox{\boldmath\Psi}^{+}(\alpha), then \frac{\mathop{d}^{2}\mbox{\boldmath\Psi}(\alpha)}{\mathop{d}\alpha^{2}}\preceq\left.\frac{\mathop{d}^{2}\mbox{\boldmath\Psi}(\alpha)}{\mathop{d}\alpha^{2}}\right|_{\alpha=\alpha_{0}}, .
Proof. It is easily to see that \mbox{\boldmath\Psi}^{+}(\alpha) is continuous and hence differentiable, . First note that \mbox{\boldmath\Psi}^{+}(\alpha) is neither a linear function nor a quadratic function of . From the proof of Lemma 1, we note that \mbox{\boldmath\Psi}^{+}(\alpha) is not a monotone function of . By Lemma 1, \mbox{\boldmath\Psi}^{+}(\alpha) is convex in . Thus, the stationary point of \mbox{\boldmath\Psi}^{+}(\alpha) exists and unique. Suppose be the stationary (critical) point, then \mbox{\boldmath\Psi}^{+}(\alpha)\succeq\mbox{\boldmath\Psi}^{+}(\alpha_{0})\succeq{\bf 0}, . This implies that \mbox{\boldmath\Psi}^{+}(\alpha)\succeq(1-\alpha)^{2}\mbox{\boldmath\Psi}^{+}(\alpha_{0})=(\alpha^{2}-2\alpha+1)\mbox{\boldmath\Psi}^{+}(\alpha_{0}), . Note that \mbox{\boldmath\Psi}^{+}(\alpha_{0}) is p.s.d., thus there exists a quadratic convex function {\mbox{\boldmath\Upsilon}}^{+}(\alpha) such that \mbox{\boldmath\Psi}^{+}(\alpha)\succeq{\mbox{\boldmath\Upsilon}}^{+}(\alpha), . Hence for sufficiently small , there exists a quadratic convex function {\mbox{\boldmath\Upsilon}}_{0}^{+}(\alpha) such that \mbox{\boldmath\Psi}^{+}(\alpha)\succeq{\mbox{\boldmath\Upsilon}}_{0}^{+}(\alpha), , and \mbox{\boldmath\Psi}^{+}(\alpha_{0})={\mbox{\boldmath\Upsilon}}_{0}^{+}(\alpha_{0}), \mbox{\boldmath\Psi}^{+}(\alpha_{0}+h)={\mbox{\boldmath\Upsilon}}_{0}^{+}(\alpha_{0}+h) and \mbox{\boldmath\Psi}^{+}(\alpha_{0}+2h)={\mbox{\boldmath\Upsilon}}_{0}^{+}(\alpha_{0}+2h). Notably,
[TABLE]
For any fixed , there exists a quadratic convex function {\mbox{\boldmath\Upsilon}}_{\star}^{+}(\alpha) such that {\mbox{\boldmath\Upsilon}}_{\star}^{+}(\alpha)\succeq{\mbox{\boldmath\Upsilon}}_{0}^{+}(\alpha), and {\mbox{\boldmath\Upsilon}}_{\star}^{+}(\alpha)=\mbox{\boldmath\Psi}^{+}(\alpha), {\mbox{\boldmath\Upsilon}}_{\star}^{+}(\alpha+h)=\mbox{\boldmath\Psi}^{+}(\alpha+h) and {\mbox{\boldmath\Upsilon}}_{\star}^{+}(\alpha+2h)=\mbox{\boldmath\Psi}^{+}(\alpha+2h) for arbitrary small . Thus,
[TABLE]
Therefore,
[TABLE]
Similarly, \mbox{\boldmath\Psi}(\alpha) is the Moore-Penrose generalized inverse of \mbox{\boldmath\Psi}^{+}(\alpha), and so \mbox{\boldmath\Psi}(\alpha_{0})\succeq\mbox{\boldmath\Psi}(\alpha)\succeq{\bf 0}, . This implies that (1+2\alpha-\alpha^{2})\mbox{\boldmath\Psi}(\alpha_{0})\succeq\mbox{\boldmath\Psi}(\alpha), . Thus, there exists a quadratic concave function {\mbox{\boldmath\Upsilon}}(\alpha) such that {\mbox{\boldmath\Upsilon}}(\alpha)\succeq\mbox{\boldmath\Psi}(\alpha), . Parallel arguments as in the case \mbox{\boldmath\Psi}^{+}(\alpha), we may also conclude that
[TABLE]
Lemma 4. * is concave on *.
Proof. Rewrite in (2.2) as
[TABLE]
where
[TABLE]
Note that
\raisebox{-.3pt}{\normalsize1}⃝
\raisebox{-.3pt}{\normalsize2}⃝
Let \mbox{\boldmath\Psi}^{+}(\alpha)={\bf B}^{+}(\alpha{\bf S}+(1-\alpha){\bf T})={\bf M}{\bf N}^{-1}{\bf M}, where , and . Then, its Moore-Penrose generalized inverse is of the form
[TABLE]
Notably,
[TABLE]
Thus,
[TABLE]
and
[TABLE]
By the results of (A.2) and (A.3), then
[TABLE]
Since \mbox{\boldmath\Psi}^{+}(\alpha)\mbox{\boldmath\Psi}(\alpha)={\bf M}({\bf M}^{\prime}{\bf M})^{-1}{\bf M}^{\prime}, thus
[TABLE]
Therefore,
[TABLE]
Notably,
- (i)
[TABLE] 2. (ii)
[TABLE] 3. (iii)
[TABLE] 4. (iv)
[TABLE] 5. (v)
[TABLE] 6. (vi)
[TABLE]
[TABLE]
Thus, by the results (A.4) and (i)-(vi) and some straightforward manipulations,
[TABLE]
Also, note that
[TABLE]
Thus, the stationary point of \mbox{\boldmath\Psi}^{+}(\alpha) satisfies the following equation
[TABLE]
which implies that
[TABLE]
Namely,
[TABLE]
Thus,
[TABLE]
Furthermore, is convex on , thus \frac{\mathop{\mathrm{d}}^{2}\mbox{\boldmath\Psi}^{+}(\alpha)}{\mathop{\mathrm{d}}\alpha^{2}}\succeq{\bf 0}, that is,
[TABLE]
Substitute these results into (A.5), then
[TABLE]
Thus,
[TABLE]
By Lemma 3, then
[TABLE]
Therefore, is concave on .
Lemma 5. Let , , , be p.s.d. of rank (). Also let with elements being the non-zero eigenvalues of , where denotes the Moore-Penrose generalized inverse of . Then there exists a nonsingular matrix such that
[TABLE]
Proof. By Theorem A.4.1 of Anderson [1],
[TABLE]
where is a nonsingular matrix. Let , then is a p.s.d. with rank . Write
[TABLE]
and take
[TABLE]
where , such that . Thus, there exists a matrix , the group of orthogonal matrices such that
[TABLE]
that is,
[TABLE]
Let , thus
[TABLE]
and
[TABLE]
Lemma 6. Let be an p.s.d. matrix and be a vector. Let and denotes , where denotes the Moore-Penrose generalized inverse of . If is a generalized inverse of , then is convex on .
Proof. Since is continuous in , it suffices to show that
[TABLE]
Write and take
[TABLE]
By Lemma 5, {\bf A}_{1}={\bf G}\left[\begin{array}[]{cc}{\bf I}_{r}&{\bf 0}\\ {\bf 0}&{\bf 0}\end{array}\right]{\bf G}^{\prime} and {\bf A}_{2}={\bf G}\left[\begin{array}[]{cc}{\bf D}_{r}&{\bf 0}\\ {\bf 0}&{\bf 0}\end{array}\right]{\bf G}^{\prime}, where is nonsingular and with being the non-zero eigenvalues of . Write {\bf A}_{1}^{\frac{1}{2}}={\bf G}\left[\begin{array}[]{cc}{\bf I}_{r}&{\bf 0}\\ {\bf 0}&{\bf 0}\end{array}\right] and {\bf A}_{2}^{\frac{1}{2}}={\bf G}\left[\begin{array}[]{cc}{\bf D}_{r}&{\bf 0}\\ {\bf 0}&{\bf 0}\end{array}\right], then
[TABLE]
Notably,
[TABLE]
Moreover,
[TABLE]
and
[TABLE]
Thus,
[TABLE]
and hence the lemma follows.
Acknowledgments
The author is grateful to Professor C. R. Rao for his helpful comments. The work was partly supported by Grants from National Science Council of the Republic of China under Contract No. NSC 93-2118-M-001-027.
References
T.W. Anderson, An Introduction to Multivariate Statistical Analysis, 2nd edition. New York: Wiley, 1984. 2. 2.
A. Birnbaum, Characterization of complete classes of tests of some multiparametric hypotheses, with application to likelihood ratio tests, Ann. Math. Statist. 26 (1955) 21-36. 3. 3.
M.L. Eaton, A complete class theorem for multidimensional one sided alternatives, Ann. Math. Statist. 41 (1970) 1884-1888. 4. 4.
J. Kiefer, R. Schwartz, Admissible Bayes character of -, -, and other fully invariant tests for classical multivariate normal problems, Ann. Math. Statist. 36 (1965) 747-770. 5. 5.
E.L. Lehmann, Testing Statistical Hypotheses, 2nd edition. New York: Wiley, 1986. 6. 6.
J. Marden, M.D. Perlman, Invariant tests for means with covariates, Ann. Statist. 8 (1980) 25-63. 7. 7.
A.W. Marshall, I. Olkin, Inequalities: Theory of Majorization and Its Applications, New York: Academic Press, 1979. 8. 8.
J. Oosterhoff, Combination of One-sided Statistical Tests, Amsterdam: Mathematisch Centrum, 1969. 9. 9.
C.R. Rao, Tests with discriminant functions in multivariate analysis, Sankhy 7 (1946) 407-413. 10. 10.
C.R. Rao, On some problems arising out of discrimination with multiple characters, Sankhy 9 (1949) 343-366. 11. 11.
C.R. Rao, S.K. Mitra, Generalized Inverse of Matrices and its Applications, New York: Wiley, 1971. 12. 12.
S.N. Roy, On a heuristic method of test construction and its use in multivariate analysis, Ann. Math. Statist. 24 (1953) 220-238. 13. 13.
R. Schwartz, Admissible tests in multivariate analysis of variance, Ann. Math. Statist. 38 (1967) 698-710. 14. 14.
J.B. Simaika, On an optimum property of two important statistical tests, Biometrika 32 (1941) 70-80. 15. 15.
C. Stein, The admissibility of Hotelling’s -test, Ann. Math. Statist. 27 (1956) 616-623.
