Independence Properties of the Truncated Multivariate Elliptical   Distributions

Michael Levine; Donald Richards; and Jianxi Su

arXiv:1904.06412·math.ST·June 4, 2019

Independence Properties of the Truncated Multivariate Elliptical Distributions

Michael Levine, Donald Richards, and Jianxi Su

PDF

Open Access

TL;DR

This paper characterizes the independence properties of truncated multivariate elliptical distributions, especially the truncated multivariate normal, and applies these findings to test independence in educational data.

Contribution

It provides a novel characterization of independence in truncated multivariate elliptical distributions, extending known results for the normal case.

Findings

01

Mutual independence of sub-vectors implies the joint distribution is truncated multivariate normal.

02

The paper verifies regularity conditions for applying Wilks' theorem in a practical test.

03

Application to educational data demonstrates the usefulness of the independence criterion.

Abstract

Truncated multivariate distributions arise extensively in econometric modelling when non-negative random variables are intrinsic to the data-generation process. More broadly, truncated multivariate distributions have appeared in censored and truncated regression models, simultaneous equations modelling, multivariate regression, and applications going back to the now-classic papers of Amemiya (1974) and Heckman (1976). In some applications of truncated multivariate distributions, there arises the problem of characterizing the distribution through correlation and independence properties of sub-vectors. In this paper, we characterize the truncated multivariate normal random vectors for which two complementary sub-vectors are mutually independent. Further, we characterize the multivariate truncated elliptical distributions, proving that if two complementary sub-vectors are mutually…

Equations126

f (w; μ, Σ, c) = C exp [- \frac{1}{2} (w - μ)^{⊤} Σ^{- 1} (w - μ)], w \geq c,

f (w; μ, Σ, c) = C exp [- \frac{1}{2} (w - μ)^{⊤} Σ^{- 1} (w - μ)], w \geq c,

C^{- 1} = \int_{w \geq c} exp [- \frac{1}{2} (w - μ)^{⊤} Σ^{- 1} (w - μ)] d w .

C^{- 1} = \int_{w \geq c} exp [- \frac{1}{2} (w - μ)^{⊤} Σ^{- 1} (w - μ)] d w .

W = (W_{1} W_{2}), μ = (μ_{1} μ_{2}), c = (c_{1} c_{2}),

W = (W_{1} W_{2}), μ = (μ_{1} μ_{2}), c = (c_{1} c_{2}),

Σ = (Σ_{11} Σ_{21} Σ_{12} Σ_{22})

Σ = (Σ_{11} Σ_{21} Σ_{12} Σ_{22})

f(\boldsymbol{w};\boldsymbol{\mu},{\boldsymbol{\Sigma}},\boldsymbol{c})=g\big{(}(\boldsymbol{w}-\boldsymbol{\mu})^{\boldsymbol{\top}}{\boldsymbol{\Sigma}}^{-1}(\boldsymbol{w}-\boldsymbol{\mu})\big{)},\qquad\boldsymbol{w}\geq\boldsymbol{c},

f(\boldsymbol{w};\boldsymbol{\mu},{\boldsymbol{\Sigma}},\boldsymbol{c})=g\big{(}(\boldsymbol{w}-\boldsymbol{\mu})^{\boldsymbol{\top}}{\boldsymbol{\Sigma}}^{-1}(\boldsymbol{w}-\boldsymbol{\mu})\big{)},\qquad\boldsymbol{w}\geq\boldsymbol{c},

Σ = (1 ρ ρ 1),

Σ = (1 ρ ρ 1),

(X_{1} X_{2}) = d R Σ^{1/2} (U_{1} U_{2}) + μ,

(X_{1} X_{2}) = d R Σ^{1/2} (U_{1} U_{2}) + μ,

U_{1}^{*} = U_{1}, U_{2}^{*} = ρ U_{1} + (1 - ρ^{2})^{1/2} U_{2};

U_{1}^{*} = U_{1}, U_{2}^{*} = ρ U_{1} + (1 - ρ^{2})^{1/2} U_{2};

Cov (W_{1}, W_{2})

Cov (W_{1}, W_{2})

= E (R^{2}) E [U_{1}^{*} U_{2}^{*} ∣ U_{1}^{*} > 0, U_{2}^{*} > 0] - (E R)^{2} i = 1 \prod 2 E [U_{i}^{*} ∣ U_{1}^{*} > 0, U_{2}^{*} > 0] .

U_{1}^{*} = cos Ψ, U_{2}^{*} = ρ cos Ψ + (1 - ρ^{2})^{1/2} sin Ψ,

U_{1}^{*} = cos Ψ, U_{2}^{*} = ρ cos Ψ + (1 - ρ^{2})^{1/2} sin Ψ,

E [U_{1}^{*} U_{2}^{*}

E [U_{1}^{*} U_{2}^{*}

\displaystyle={\mathbb{E}}\big{[}\rho\cos^{2}\Psi+(1-\rho^{2})^{1/2}\sin\Psi\cos\Psi\,\big{|}\,\Psi\in(\psi^{*},\pi/2)\big{]}

= (\frac{1}{2} π - ψ^{*})^{- 1} [\frac{1}{4} ρ (π - sin 2 ψ^{*}) - \frac{1}{2} ρ ψ + \frac{1}{4} (1 - ρ^{2})^{1/2} (1 + cos 2 ψ^{*})] \equiv h_{1} (ρ) .

E [U_{1}^{*} ∣ U_{1}^{*} > 0, U_{2}^{*} > 0]

E [U_{1}^{*} ∣ U_{1}^{*} > 0, U_{2}^{*} > 0]

= (\frac{1}{2} π - ψ^{*})^{- 1} [1 - sin ψ^{*}] \equiv h_{2} (ρ),

E [U_{2}^{*} ∣ U_{1}^{*} > 0, U_{2}^{*} > 0]

E [U_{2}^{*} ∣ U_{1}^{*} > 0, U_{2}^{*} > 0]

= (\frac{1}{2} π - ψ^{*})^{- 1} [ρ (1 - sin ψ^{*}) + (1 - ρ^{2})^{1/2} cos ψ^{*}] \equiv h_{3} (ρ) .

Cov (W_{1}, W_{2}) = E [R^{2}] h_{1} (ρ) - [E R]^{2} h_{2} (ρ) h_{3} (ρ) .

Cov (W_{1}, W_{2}) = E [R^{2}] h_{1} (ρ) - [E R]^{2} h_{2} (ρ) h_{3} (ρ) .

\frac{E [ R ^{2} ]}{[ E R ] ^{2}} = \frac{4Γ ( τ /2 ) Γ (( τ - 2 ) /2 )}{π [ Γ (( τ - 1 ) /2 ) ] ^{2}},

\frac{E [ R ^{2} ]}{[ E R ] ^{2}} = \frac{4Γ ( τ /2 ) Γ (( τ - 2 ) /2 )}{π [ Γ (( τ - 1 ) /2 ) ] ^{2}},

lo g (Γ ((τ - 1) /2) < [lo g (Γ (τ /2)) + lo g (Γ ((τ - 2) /2)] /2,

lo g (Γ ((τ - 1) /2) < [lo g (Γ (τ /2)) + lo g (Γ ((τ - 2) /2)] /2,

h_{1} (- 1/ 2) = \frac{2 ( 4 - π )}{4 π}, h_{2} (- 1/ 2) = h_{3} (- 1/ 2) = \frac{4 ( 2 - 2 )}{2 π} .

h_{1} (- 1/ 2) = \frac{2 ( 4 - π )}{4 π}, h_{2} (- 1/ 2) = h_{3} (- 1/ 2) = \frac{4 ( 2 - 2 )}{2 π} .

b := \frac{E [ R ^{2} ]}{[ E R ] ^{2}} = \frac{h _{2} ( - 1/ 2 ) h _{3} ( - 1/ 2 )}{h _{1} ( - 1/ 2 )} = \frac{16 ( 3 2 - 4 )}{π ( 4 - π )} \approx 1.44,

b := \frac{E [ R ^{2} ]}{[ E R ] ^{2}} = \frac{h _{2} ( - 1/ 2 ) h _{3} ( - 1/ 2 )}{h _{1} ( - 1/ 2 )} = \frac{16 ( 3 2 - 4 )}{π ( 4 - π )} \approx 1.44,

{w \in R^{p} : w \geq c} \equiv {w_{1} \in R^{p_{1}}, w_{2} \in R^{p_{2}} : w_{1} \geq c_{1}, w_{2} \geq c_{2}}

{w \in R^{p} : w \geq c} \equiv {w_{1} \in R^{p_{1}}, w_{2} \in R^{p_{2}} : w_{1} \geq c_{1}, w_{2} \geq c_{2}}

(

(

= (w_{1} - μ_{1})^{⊤} Σ_{11}^{- 1} (w_{1} - μ_{1})

\displaystyle\quad+\big{(}\boldsymbol{w}_{2}-\boldsymbol{\mu}_{2}-{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}_{11}^{-1}(\boldsymbol{w}_{1}-\boldsymbol{\mu}_{1})\big{)}^{\boldsymbol{\top}}{\boldsymbol{\Sigma}}_{22\cdot 1}^{-1}\big{(}\boldsymbol{w}_{2}-\boldsymbol{\mu}_{2}-{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}_{11}^{-1}(\boldsymbol{w}_{1}-\boldsymbol{\mu}_{1})\big{)},

\int_{\boldsymbol{w}_{2}\geq\boldsymbol{0}}\exp\big{[}-\tfrac{1}{2}\big{(}\boldsymbol{w}_{2}-\boldsymbol{\mu}_{2}-{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}_{11}^{-1}(\boldsymbol{w}_{1}-\boldsymbol{\mu}_{1})\big{)}^{\boldsymbol{\top}}\\ \times{\boldsymbol{\Sigma}}_{22\cdot 1}^{-1}\big{(}\boldsymbol{w}_{2}-\boldsymbol{\mu}_{2}-{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}_{11}^{-1}(\boldsymbol{w}_{1}-\boldsymbol{\mu}_{1})\big{)}\big{]}\hskip 1.0pt{\rm{d}}\boldsymbol{w}_{2}.

\int_{\boldsymbol{w}_{2}\geq\boldsymbol{0}}\exp\big{[}-\tfrac{1}{2}\big{(}\boldsymbol{w}_{2}-\boldsymbol{\mu}_{2}-{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}_{11}^{-1}(\boldsymbol{w}_{1}-\boldsymbol{\mu}_{1})\big{)}^{\boldsymbol{\top}}\\ \times{\boldsymbol{\Sigma}}_{22\cdot 1}^{-1}\big{(}\boldsymbol{w}_{2}-\boldsymbol{\mu}_{2}-{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}_{11}^{-1}(\boldsymbol{w}_{1}-\boldsymbol{\mu}_{1})\big{)}\big{]}\hskip 1.0pt{\rm{d}}\boldsymbol{w}_{2}.

(2 π)^{p_{2} /2} (det Σ_{22 \cdot 1})^{1/2} P (V \geq 0),

(2 π)^{p_{2} /2} (det Σ_{22 \cdot 1})^{1/2} P (V \geq 0),

P (V \geq 0)

P (V \geq 0)

\displaystyle=P\big{(}\boldsymbol{V}_{0}\geq-\boldsymbol{\mu}_{2}-{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}_{11}^{-1}(\boldsymbol{w}_{1}-\boldsymbol{\mu}_{1})\big{)}.

P (V \geq 0)

P (V \geq 0)

\displaystyle=P(\boldsymbol{V}_{0}\leq\boldsymbol{\mu}_{2}+{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}_{11}^{-1}(\boldsymbol{w}_{1}-\boldsymbol{\mu}_{1})\big{)},

f_{{\boldsymbol{W}}_{1}}(\boldsymbol{w}_{1})=C\,(2\pi)^{p_{2}/2}\,(\det{\boldsymbol{\Sigma}}_{22\cdot 1})^{1/2}\exp\big{[}-\tfrac{1}{2}(\boldsymbol{w}_{1}-\boldsymbol{\mu}_{1})^{\boldsymbol{\top}}{\boldsymbol{\Sigma}}_{11}^{-1}(\boldsymbol{w}_{1}-\boldsymbol{\mu}_{1})\big{]}\\ \times\Phi_{p_{2}}\big{(}\boldsymbol{\mu}_{2}+{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}_{11}^{-1}(\boldsymbol{w}_{1}-\boldsymbol{\mu}_{1}),{\boldsymbol{\Sigma}}_{22\cdot 1}\big{)},

f_{{\boldsymbol{W}}_{1}}(\boldsymbol{w}_{1})=C\,(2\pi)^{p_{2}/2}\,(\det{\boldsymbol{\Sigma}}_{22\cdot 1})^{1/2}\exp\big{[}-\tfrac{1}{2}(\boldsymbol{w}_{1}-\boldsymbol{\mu}_{1})^{\boldsymbol{\top}}{\boldsymbol{\Sigma}}_{11}^{-1}(\boldsymbol{w}_{1}-\boldsymbol{\mu}_{1})\big{]}\\ \times\Phi_{p_{2}}\big{(}\boldsymbol{\mu}_{2}+{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}_{11}^{-1}(\boldsymbol{w}_{1}-\boldsymbol{\mu}_{1}),{\boldsymbol{\Sigma}}_{22\cdot 1}\big{)},

f_{W_{2} ∣ W_{1} = w_{1}} (w_{2})

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Distribution Estimation and Applications · Advanced Statistical Methods and Models · Statistical Methods and Bayesian Inference

Full text

Independence Properties of the Truncated Multivariate Elliptical Distributions

Michael Levine, Donald Richards, and Jianxi Su Department of Statistics, Purdue University, West Lafayette, IN 47907, U.S.A.Department of Statistics, Pennsylvania State University, University Park, PA 16802, U.S.A.Department of Statistics, Purdue University, West Lafayette, IN 47907, U.S.A. ∗Corresponding author; e-mail address: [email protected]

Abstract

Truncated multivariate distributions arise extensively in econometric modelling, when non-negative random variables are intrinsic to the data-generation process. and more broadly in censored and truncated regression models, simultaneous equations modelling, multivariate regression, and other areas. In some applications, there arises the problem of characterizing truncated multivariate distributions through correlation and independence properties of sub-vectors. In this paper, we characterize the truncated multivariate normal random vectors for which two complementary sub-vectors are mutually independent. Further, we characterize the multivariate truncated elliptical distributions, proving that if two complementary sub-vectors are mutually independent then the distribution of the joint vector is truncated multivariate normal, as is the distribution of each sub-vector. As an application, we apply the independence criterion to test the hypothesis of independence of the entrance examination scores and subsequent course averages achieved by a sample of university students; to do so, we verify the regularity conditions underpinning a classical theorem of Wilks on the asymptotic null distribution of the likelihood ratio test statistic.

Key words and phrases. Truncated elliptical distributions, multivariate normal distributions, correlation, independence

2010 Mathematics Subject Classification. Primary: 62H20, 60E05. Secondary: 62E10.

Running head: Truncated elliptical distributions.

1 Introduction

The truncated multivariate normal distributions are a family of distributions that have appeared in simultaneous equations modelling and multivariate regression [2], economics [11], econometric models for auction theory [14], and other areas. Consequently, there exists a wide literature on the properties of these distributions.

To define the truncated multivariate normal distributions, we recall the component-wise partial ordering on $p$ -dimensional Euclidean space, $\mathbb{R}^{p}$ : For column vectors $\boldsymbol{u}=(u_{1},\ldots,u_{p})^{\boldsymbol{\top}}$ and $\boldsymbol{v}=(v_{1},\ldots,v_{p})^{\boldsymbol{\top}}$ in $\mathbb{R}^{p}$ we write $\boldsymbol{u}\geq\boldsymbol{v}$ if $u_{j}\geq v_{j}$ for all $j=1,\ldots,p$ .

Let $\boldsymbol{\mu}\in\mathbb{R}^{p}$ and let ${\boldsymbol{\Sigma}}$ be a $p\times p$ positive definite matrix. For $\boldsymbol{c}\in\mathbb{R}^{p}$ , we say that the random vector ${\boldsymbol{W}}\in\mathbb{R}^{p}$ has a truncated multivariate normal distribution, with truncation point $\boldsymbol{c}$ , if the probability density function of ${\boldsymbol{W}}$ is

[TABLE]

where $C$ , the normalizing constant, is given by

[TABLE]

We write ${\boldsymbol{W}}\sim N_{p}(\boldsymbol{\mu},{\boldsymbol{\Sigma}},\boldsymbol{c})$ whenever ${\boldsymbol{W}}$ has the density function (1.1). Further, we denote the usual (untruncated) multivariate normal distribution by $N_{p}(\boldsymbol{\mu},{\boldsymbol{\Sigma}})$ .

Suppose that ${\boldsymbol{W}}$ , $\boldsymbol{\mu}$ , and $\boldsymbol{c}$ are partitioned into sub-vectors,

[TABLE]

where ${\boldsymbol{W}}_{j}$ , $\boldsymbol{\mu}_{j}$ , and $\boldsymbol{c}_{j}$ each are of dimension $p_{j}$ , $j=1,2$ , with $p_{1}+p_{2}=p$ . Further, we partition ${\boldsymbol{\Sigma}}$ so that

[TABLE]

where ${\boldsymbol{\Sigma}}_{jk}$ is of order $p_{j}\times p_{k}$ , for $j,k=1,2$ . In a study of the correlation and independence properties of sub-vectors of truncated distributions, we show in Section 2 that the uncorrelatedness of ${\boldsymbol{W}}_{1}$ and ${\boldsymbol{W}}_{2}$ cannot be characterized by the condition that ${\boldsymbol{\Sigma}}_{12}=\boldsymbol{0}$ . Going beyond the study of the correlation properties of ${\boldsymbol{W}}$ , we prove in Section 3 that the condition ${\boldsymbol{\Sigma}}_{12}=\boldsymbol{0}$ is necessary and sufficient for ${\boldsymbol{W}}_{1}$ and ${\boldsymbol{W}}_{2}$ to be mutually independent; in particular, no restrictions are required on $\boldsymbol{\mu}$ or $\boldsymbol{c}$ .

More general than the truncated multivariate normal distributions are their elliptical counterparts. For $\boldsymbol{c}$ and $\boldsymbol{\mu}$ in $\mathbb{R}^{p}$ , and a positive definite matrix ${\boldsymbol{\Sigma}}$ , a random vector ${\boldsymbol{W}}\in\mathbb{R}^{p}$ is said to have a truncated elliptical distribution, with truncation point $\boldsymbol{c}$ , if its probability density function is of the form

[TABLE]

for a non-constant generator $g:[0,\infty)\to[0,\infty)$ . We write $({\boldsymbol{W}}_{1},{\boldsymbol{W}}_{2})\sim E_{p}(\boldsymbol{\mu},\boldsymbol{\Sigma},g,\boldsymbol{c})$ with the untruncated counterpart being denoted by $E_{p}(\boldsymbol{\mu},\boldsymbol{\Sigma},g)$ . Examples of truncated elliptically contoured distributions are the truncated multivariate Student’s $t$ -distributions [12, 16]. We prove in Section 4 that if $({\boldsymbol{W}}_{1},{\boldsymbol{W}}_{2})\sim E_{p}(\boldsymbol{\mu},\boldsymbol{\Sigma},g,\boldsymbol{c})$ then, under certain regularity conditions on the generator $g$ , a necessary and sufficient condition that ${\boldsymbol{W}}_{1}$ and ${\boldsymbol{W}}_{2}$ be independent is that ${\boldsymbol{\Sigma}}_{12}=\boldsymbol{0}$ . Here again, no conditions are required on $\boldsymbol{\mu}$ or $\boldsymbol{c}$ ; moreover, we verify that the stated regularity conditions on $g$ are mild since they hold for many familiar elliptical distributions.

In Section 5, we consider for illustrative purposes an application of the criterion derived in Section 3 to testing the hypothesis of independence of the sub-vectors ${\boldsymbol{W}}_{1}$ and ${\boldsymbol{W}}_{2}$ . We obtain from the classical theorem of Wilks [13] the asymptotic null distribution of the likelihood ratio test statistic, and we provide an application to a data set given by Cohen [9] on the entrance examination scores and subsequent course averages achieved by a large sample of university students.

2 Correlation properties of truncated elliptical distributions

In this section we show, first, that the correlation structure of a multivariate elliptical distribution does not describe the correlation structure of its truncated version. More precisely, even if a particular multivariate elliptical distribution possesses an identity correlation matrix, this fact is not equivalent to the lack of correlation between components of the truncated version of that multivariate elliptical distribution.

We will demonstrate our claim using the bivariate case. Starting with elliptically distributed random variables $(X_{1},X_{2})^{\boldsymbol{\top}}\sim E_{2}(\boldsymbol{\mu},\boldsymbol{\Sigma},g)$ , set

[TABLE]

without loss of generality, where $|\rho|<1$ . Let $(W_{1},W_{2})=(X_{1},X_{2})|\{X_{1}\geq c_{1},X_{2}\geq c_{2}\}$ be the version of $(X_{1},X_{2})$ that is truncated at $\boldsymbol{c}=(c_{1},c_{2})^{\boldsymbol{\top}}$ . For simplicity, consider the case in which $\boldsymbol{c}=\boldsymbol{\mu}$ , so that $(W_{1},W_{2})^{\boldsymbol{\top}}\sim E_{2}(\boldsymbol{\mu},\boldsymbol{\Sigma},g,\boldsymbol{c}).$ We will now show that uncorrelatedness between $W_{1}$ and $W_{2}$ is not equivalent to $\rho=0$ .

At the outset, let us recall from [7] a stochastic representation for elliptically distributed random variables:

[TABLE]

where $(U_{1},U_{2})^{\boldsymbol{\top}}$ is distributed uniformly over the unit circle, and the generating random variable $R$ has the density function $f(r)=2\pi rg(r^{2}),$ $r>0$ . Define

[TABLE]

then, $\boldsymbol{\Sigma}^{1/2}\ (U_{1},U_{2})^{\boldsymbol{\top}}=(U_{1}^{*},U^{*}_{2})^{\boldsymbol{\top}}$ , and

[TABLE]

To calculate these conditional expectations, we transform $(U_{1}^{*},U_{2}^{*})$ to polar coordinates,

[TABLE]

where the random variable $\Psi$ is uniformly distributed on the interval $(-\pi,\pi)$ . Letting $\psi^{*}=\tan^{-1}(-(1-\rho^{2})^{-1/2}\rho)$ , we obtain

[TABLE]

Similarly,

[TABLE]

and

[TABLE]

In summary, we have obtained

[TABLE]

Note that $\rho=0$ implies $\psi^{*}=0$ . Hence, $h_{1}(0)=1/\pi$ and $h_{2}(0)=h_{3}(0)=2/\pi$ .

We remark that uncorrelatedness cannot be characterized for all elliptical truncated distributions through the condition $\rho=0$ . Consider, for instance, the truncated bivariate Student’s $t$ -distribution with degrees-of-freedom $\tau>0$ , where the associated generating variable $R$ has the density function that is proportional to $(1+\tau^{-1}r^{2})^{-(\tau+2)/2}$ , $r>0$ ; this density corresponds to the generalized beta distribution of the second kind [17]. It is straightforward to deduce that

[TABLE]

$\tau>2$ . Noting that the gamma function $\Gamma(\cdot)$ is strictly log-convex [4], we have

[TABLE]

$\tau>2$ , equivalently ${{\mathbb{E}}[R^{2}]}/{[{\mathbb{E}}\,R]^{2}}>4/\pi$ . By Equation (2.1), ${\mathrm{Cov}}(W_{1},W_{2})>0$ ; hence, for the truncated bivariate Student’s $t$ -distributions with truncation points equal to the means, the condition $\rho=0$ implies that $W_{1}$ and $W_{2}$ are positively correlated.

We remark that for the above example, uncorrelatedness holds in a limiting sense as $\tau\rightarrow\infty$ ; in that case, ${{\mathbb{E}}[R^{2}]}/{[{\mathbb{E}}\,R]^{2}}\rightarrow\pi/4$ and hence ${\mathrm{Cov}}(W_{1},W_{2})\rightarrow 0$ . This limiting case corresponds to the truncated bivariate normal distributions, which we treat in the next section.

On the other hand, for given $\rho\neq 0$ , we can apply Equation (2.1) to construct a plethora of truncated elliptical distributions that are uncorrelated. For the sake of illustration, suppose that $\rho=-1/\sqrt{2}$ ; then $\psi^{*}=\pi/4$ and

[TABLE]

Therefore, for any truncated elliptical distributions whose generating variable satisfies

[TABLE]

the variables $W_{1}$ and $W_{2}$ are uncorrelated. For example, if $R$ follows a gamma distribution with shape parameter $(b-1)^{-1}\approx 2.27$ and any positive scale parameter, then Equation (2.2) can be satisfied.

We have now shown that even in the bivariate case and for the special case in which the truncation vector $\boldsymbol{c}$ equals the mean $\boldsymbol{\mu}$ , the truncated elliptical distributions do not inherit the correlation property of the untruncated elliptical distributions. On the one hand, it is possible that $\rho=0$ can lead to positively correlated $W_{1}$ and $W_{2}$ , as we have seen from the example on the truncated Student’s $t$ -distributions. On the other hand, there exist elliptical distributions with $\rho<0$ such that the components of their truncated versions are uncorrelated.

3 The multivariate normal case

Throughout the rest of the paper, we denote by $\boldsymbol{0}$ any zero matrix or vector, irrespective of the dimension. In this section, we prove that the independence property of multivariate normal distributions can be carried over to their truncated counterparts.

Theorem 3.1.

Suppose that the random vector ${\boldsymbol{W}}\sim N_{p}(\boldsymbol{\mu},{\boldsymbol{\Sigma}},\boldsymbol{c})$ is decomposed as in (1.2). Then ${\boldsymbol{W}}_{1}$ and ${\boldsymbol{W}}_{2}$ are mutually independent if and only if ${\boldsymbol{\Sigma}}_{12}=\boldsymbol{0}$ .

We remark that this result was stated in [15, p. 214]. However, an inspection of the purported proof [15, p. 218] reveals that the ‘if’ part of the result solely was established, so the converse assertion has remained open. Unlike the classical untruncated normal distribution, the matrix ${\boldsymbol{\Sigma}}$ is not the covariance matrix of ${\boldsymbol{W}}$ , so it is surprising that the independence of ${\boldsymbol{W}}_{1}$ and ${\boldsymbol{W}}_{2}$ is characterized by the condition ${\boldsymbol{\Sigma}}_{12}=\boldsymbol{0}$ .

Proof of Theorem 3.1. First, we note that

[TABLE]

Now suppose that ${\boldsymbol{\Sigma}}_{12}=\boldsymbol{0}$ . Then it is evident from (1.1), (1.3), and (3.1) that the density of ${\boldsymbol{W}}$ reduces to a product of two terms corresponding to the distributions $N_{p_{1}}(\boldsymbol{\mu}_{1},{\boldsymbol{\Sigma}}_{11},\boldsymbol{c}_{1})$ and $N_{p_{2}}(\boldsymbol{\mu}_{2},{\boldsymbol{\Sigma}}_{22},\boldsymbol{c}_{2})$ . Consequently, ${\boldsymbol{W}}_{1}$ and ${\boldsymbol{W}}_{2}$ are mutually independent, and ${\boldsymbol{W}}_{j}\sim N_{p_{j}}(\boldsymbol{\mu}_{j},{\boldsymbol{\Sigma}}_{jj},\boldsymbol{c}_{j})$ , $j=1,2$ .

Conversely, suppose that ${\boldsymbol{W}}_{1}$ and ${\boldsymbol{W}}_{2}$ are mutually independent. For ${\boldsymbol{W}}\sim N_{p}(\boldsymbol{\mu},{\boldsymbol{\Sigma}},\boldsymbol{c})$ , it is evident that ${\boldsymbol{W}}-\boldsymbol{c}\sim N_{p}(\boldsymbol{\mu}-\boldsymbol{c},{\boldsymbol{\Sigma}},\boldsymbol{0})$ . Since ${\boldsymbol{W}}_{1}$ and ${\boldsymbol{W}}_{2}$ are mutually independent if and only if ${\boldsymbol{W}}_{1}-\boldsymbol{c}_{1}$ and ${\boldsymbol{W}}_{2}-\boldsymbol{c}_{2}$ are mutually independent then we can assume, with no loss of generality, that $\boldsymbol{c}=\boldsymbol{0}$ .

Thus, for ${\boldsymbol{W}}\sim N_{p}(\boldsymbol{\mu},{\boldsymbol{\Sigma}},\boldsymbol{0})$ , suppose that ${\boldsymbol{W}}_{1}$ is independent of ${\boldsymbol{W}}_{2}$ . By a well-known quadratic form decomposition (Anderson [3, p. 638]), we have

[TABLE]

where ${\boldsymbol{\Sigma}}_{22\cdot 1}={\boldsymbol{\Sigma}}_{22}-{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}_{11}^{-1}{\boldsymbol{\Sigma}}_{12}$ . Applying this decomposition to the density function (1.1), we find that in order to calculate the marginal density of ${\boldsymbol{W}}_{1}$ it is necessary to consider the integral

[TABLE]

For fixed $\boldsymbol{w}_{1}$ , suppose that $\boldsymbol{V}$ is a $p_{2}$ -dimensional multivariate normal random vector with $\boldsymbol{V}\sim N_{p_{2}}\big{(}\boldsymbol{\mu}_{2}+{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}_{11}^{-1}(\boldsymbol{w}_{1}-\boldsymbol{\mu}_{1}),{\boldsymbol{\Sigma}}_{22\cdot 1}\big{)}$ . Then the integral (3.3) equals

[TABLE]

Let $\boldsymbol{V}_{0}=\boldsymbol{V}-\boldsymbol{\mu}_{2}-{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}_{11}^{-1}(\boldsymbol{w}_{1}-\boldsymbol{\mu}_{1})\sim N_{p_{2}}(\boldsymbol{0},{\boldsymbol{\Sigma}}_{22\cdot 1})$ ; then,

[TABLE]

Since $\boldsymbol{V}_{0}$ has the same distribution as $-\boldsymbol{V}_{0}$ then it follows that

[TABLE]

and we denote this probability by $\Phi_{p_{2}}\big{(}\boldsymbol{\mu}_{2}+{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}_{11}^{-1}(\boldsymbol{w}_{1}-\boldsymbol{\mu}_{1}),{\boldsymbol{\Sigma}}_{22\cdot 1}\big{)}$ .

Therefore, the marginal density function of ${\boldsymbol{W}}_{1}$ is

[TABLE]

$\boldsymbol{w}_{1}\geq\boldsymbol{0}$ . It now follows from (1.1), (3.4), and the quadratic form decomposition (3.2), that the conditional density function of ${\boldsymbol{W}}_{2}$ , given ${\boldsymbol{W}}_{1}=\boldsymbol{w}_{1}$ , is

[TABLE]

$\boldsymbol{w}_{1}\geq\boldsymbol{0}$ , $\boldsymbol{w}_{2}\geq\boldsymbol{0}$ .

Since ${\boldsymbol{W}}_{1}$ is independent of ${\boldsymbol{W}}_{2}$ then $f_{{\boldsymbol{W}}_{2}|{\boldsymbol{W}}_{1}=\boldsymbol{w}_{1}}$ is constant in $\boldsymbol{w}_{1}$ . Therefore,

[TABLE]

$\boldsymbol{w}_{1}\geq\boldsymbol{0}$ , $\boldsymbol{w}_{2}\geq\boldsymbol{0}$ , so we obtain

[TABLE]

$\boldsymbol{w}_{1}\geq\boldsymbol{0}$ , $\boldsymbol{w}_{2}\geq\boldsymbol{0}$ . Cancelling common terms, we obtain

[TABLE]

Note that the left-hand side contains no term in $\boldsymbol{w}_{2}$ , whereas the right-hand side does. Therefore, for all $\boldsymbol{w}_{1}$ , the coefficient of $\boldsymbol{w}_{2}$ on the right-hand side necessarily is the zero vector; this can be proved by taking the logarithm of both sides and then calculating the gradient with respect to $\boldsymbol{w}_{2}$ .

Hence, $\boldsymbol{w}_{1}^{\boldsymbol{\top}}{\boldsymbol{\Sigma}}_{11}^{-1}{\boldsymbol{\Sigma}}_{12}{\boldsymbol{\Sigma}}_{22\cdot 1}^{-1}\equiv\boldsymbol{0}$ . Since this holds for all $\boldsymbol{w}_{1}\geq\boldsymbol{0}$ then we obtain ${\boldsymbol{\Sigma}}_{11}^{-1}{\boldsymbol{\Sigma}}_{12}{\boldsymbol{\Sigma}}_{22\cdot 1}^{-1}=\boldsymbol{0}$ . As ${\boldsymbol{\Sigma}}_{11}$ and ${\boldsymbol{\Sigma}}_{22\cdot 1}$ are non-singular, it follows that ${\boldsymbol{\Sigma}}_{12}=\boldsymbol{0}$ . $\quad\qed$

Remark 3.2.

We remark that since the condition ${\boldsymbol{\Sigma}}_{12}=\boldsymbol{0}$ , which is necessary and sufficient for ${\boldsymbol{W}}_{1}$ and ${\boldsymbol{W}}_{2}$ to be mutually independent, requires no restrictions on $\boldsymbol{c}$ , then the same result holds if we let $\boldsymbol{c}_{2}\to-\infty$ . Consequently, Theorem 3.1 remains valid if ${\boldsymbol{W}}_{1}$ is truncated and ${\boldsymbol{W}}_{2}$ is untruncated.

4 The elliptical case

In the elliptical case, as in the normal case, we may assume with no loss of generality, that the truncation point is $\boldsymbol{c}=\boldsymbol{0}$ . Suppose that ${\boldsymbol{W}}=({\boldsymbol{W}}_{1},{\boldsymbol{W}}_{2})$ has a truncated elliptical distribution with density function (1.4). Let

[TABLE]

so the joint p.d.f. of ${\boldsymbol{W}}$ is $g(Q(\boldsymbol{w}_{1},\boldsymbol{w}_{2}))$ . In characterizing the distribution of ${\boldsymbol{W}}$ through the independence of ${\boldsymbol{W}}_{1}$ and ${\boldsymbol{W}}_{2}$ , we will require the following regularity conditions on the generator $g$ :

(R1)

$g(t)>0$ for all $t\geq 0$ , $g$ is everywhere differentiable on $(0,\infty)$ , and its derivative $g^{\prime}$ is continuous. 2. (R2)

The support of $g^{\prime}$ , i.e., ${\rm supp}(g^{\prime})=\{t>0:g^{\prime}(t)\neq 0\}$ , is dense in $(0,\infty)$ . 3. (R3)

As $t\rightarrow\infty$ , $\hskip 1.0pt{\rm{d}}(\log g(t^{2}))/\hskip 1.0pt{\rm{d}}t$ either tends to zero or diverges.

We remark that these conditions appear to be mild as almost all of the commonly-used elliptical density functions that are described in [7, Chapter 3] satisfy ((R1))-((R3)), an exception being the Kotz distribution with power parameter in the exponential term equal to $1/2.$

Now we establish as a consequence of Theorem 3.1 a result that, under the regularity conditions ((R1))-((R3)), a truncated multivariate elliptical distribution whose component vectors are independent can only be a truncated multivariate normal distribution.

Corollary 4.1.

Suppose that the generator $g$ satisfies the regularity conditions ((R1))-((R3)). Then ${\boldsymbol{W}}_{1}$ and ${\boldsymbol{W}}_{2}$ are independent if and only if ${\boldsymbol{W}}$ has a truncated multivariate normal distribution with ${\boldsymbol{\Sigma}}_{12}=\boldsymbol{0}$ .

Proof.

If ${\boldsymbol{W}}$ has a truncated multivariate normal distribution with ${\boldsymbol{\Sigma}}_{12}=\boldsymbol{0}$ then we have seen before that ${\boldsymbol{W}}_{1}$ and ${\boldsymbol{W}}_{2}$ are mutually independent, so we need only show the converse.

By integration, we obtain the marginal density function of ${\boldsymbol{W}}_{2}$ as

[TABLE]

and then the conditional density of ${\boldsymbol{W}}_{1}$ , given ${\boldsymbol{W}}_{2}=\boldsymbol{w}_{2}$ , is

[TABLE]

Note that ${\boldsymbol{W}}_{1}$ and ${\boldsymbol{W}}_{2}$ are independent if and only if the conditional density function, (4.1), of ${\boldsymbol{W}}_{1}$ , given ${\boldsymbol{W}}_{2}=\boldsymbol{w}_{2}$ , is constant in $\boldsymbol{w}_{2}$ . By taking logarithms in (4.1) and then applying the gradient operator $\nabla_{\boldsymbol{w}_{2}}=(\partial/\partial w_{p_{1}+1},\ldots,\partial/\partial w_{p})^{\boldsymbol{\top}}$ , we find that a necessary and sufficient condition for ${\boldsymbol{W}}_{1}$ and ${\boldsymbol{W}}_{2}$ to be independent is that

[TABLE]

for all $\boldsymbol{w}_{1}\geq\boldsymbol{0}$ , $\boldsymbol{w}_{2}\geq\boldsymbol{0}$ . By (3.2),

[TABLE]

substituting this result in (4.2), we find that a necessary and sufficient condition for independence is

[TABLE]

$\boldsymbol{w}_{1}\geq\boldsymbol{0}$ , $\boldsymbol{w}_{2}\geq\boldsymbol{0}$ . Cancelling ${\boldsymbol{\Sigma}}_{22\cdot 1}^{-1}$ on both sides of the latter equation, we obtain

[TABLE]

$\boldsymbol{w}_{1}\geq\boldsymbol{0}$ , $\boldsymbol{w}_{2}\geq\boldsymbol{0}$ .

Let $\boldsymbol{\eta}\geq\boldsymbol{0}$ be such that $\boldsymbol{\eta}\neq\boldsymbol{\mu}_{2}-{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}_{11}^{-1}\boldsymbol{\mu}_{1}$ . Evaluating both sides of (4.3) at $\boldsymbol{w}_{2}=\boldsymbol{\eta}$ , we obtain

[TABLE]

equivalently,

[TABLE]

for all $\boldsymbol{w}_{1}\geq\boldsymbol{0}$ , where $\boldsymbol{c}_{2}=\boldsymbol{\mu}_{2}-\boldsymbol{\eta}-{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}^{-1}_{11}\boldsymbol{\mu}_{1}$ and $\boldsymbol{c}_{1}$ is a $p_{2}\times 1$ constant vector.

We also have $\|\boldsymbol{c}_{1}\|<\infty$ ; otherwise, the left-hand side of (4.4) is infinite for all $\boldsymbol{w}_{1}\geq\boldsymbol{0}$ , and then it follows that $|g^{\prime}(Q(\boldsymbol{w}_{1},\boldsymbol{\eta}))|$ is infinite for all $\boldsymbol{w}_{1}\geq\boldsymbol{0}$ . This implies that $g$ is unbounded everywhere, which is not possible since $g$ generates a density function.

Suppose that $\boldsymbol{c}_{1}=\boldsymbol{0}$ ; then, by (4.5), $g^{\prime}(Q(\boldsymbol{w}_{1},\boldsymbol{\eta}))=0$ or ${\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}^{-1}_{11}\boldsymbol{w}_{1}+\boldsymbol{c}_{2}=\boldsymbol{0}$ for all $\boldsymbol{w}_{1}\geq\boldsymbol{0}$ . If $g^{\prime}(Q(\boldsymbol{w}_{1},\boldsymbol{\eta}))=0$ for all $\boldsymbol{w}_{1}\geq\boldsymbol{0}$ then it follows that $g$ is a constant function; however, by ((R2)), the support of $g^{\prime}$ is dense, therefore $g$ cannot generate a density. Also, by construction, $\boldsymbol{c}_{2}\neq\boldsymbol{0}$ , so ${\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}^{-1}_{11}\boldsymbol{w}_{1}+\boldsymbol{c}_{2}\neq\boldsymbol{0}$ for all $\boldsymbol{w}_{1}\geq\boldsymbol{0}$ . Therefore, we have shown by contradiction that $\boldsymbol{c}_{1}\neq\boldsymbol{0}$ .

Now suppose that ${\boldsymbol{\Sigma}}_{12}\neq\boldsymbol{0}$ . Since ${\boldsymbol{\Sigma}}_{11}$ is positive definite then ${\boldsymbol{\Sigma}}_{11}^{-1}{\boldsymbol{\Sigma}}_{12}{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}_{11}^{-1}$ is positive semidefinite and has the same rank as ${\boldsymbol{\Sigma}}_{12}$ . Since ${\boldsymbol{\Sigma}}_{12}\neq\boldsymbol{0}$ then that rank is at least $1$ , so at least one diagonal entry of ${\boldsymbol{\Sigma}}_{11}^{-1}{\boldsymbol{\Sigma}}_{12}{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}_{11}^{-1}$ is positive; without loss of generality, we assume that the first diagonal entry, $({\boldsymbol{\Sigma}}_{11}^{-1}{\boldsymbol{\Sigma}}_{12}{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}_{11}^{-1})_{11}$ , is positive. Letting $\boldsymbol{e}_{1}=(1,0,\ldots,0)^{\boldsymbol{\top}}$ , we obtain

[TABLE]

consequently, $\|v{\boldsymbol{\Sigma}}_{21}{\boldsymbol{\Sigma}}_{11}^{-1}\,\boldsymbol{e}_{1}\|\to\infty$ as $v\to\infty$ .

By (3.2),

[TABLE]

therefore, as $v\to\infty$ , we obtain $Q(v\boldsymbol{e}_{1},\boldsymbol{\eta})\sim\kappa^{2}v^{2}$ where

[TABLE]

the $(1,1)$ th entry of ${\boldsymbol{\Sigma}}^{-1}$ . Letting $v\to\infty$ in (4.5), we obtain

[TABLE]

By the regularity condition ((R3)), $vg^{\prime}(v^{2})/g(v^{2})$ tends to zero or diverges as $v\to\infty$ . If $vg^{\prime}(v^{2})/g(v^{2})\to 0$ then the right-hand side of Equation (4) tends to zero as $v\to\infty$ , so we obtain $\boldsymbol{c}_{1}=\boldsymbol{0}$ , which contradicts the fact that $\boldsymbol{c}_{1}\neq\boldsymbol{0}$ . On the other hand, if $vg^{\prime}(v^{2})/g(v^{2})$ diverges as $v\to\infty$ , then the right-hand side of Equation (4) diverges, which contradicts the fact that $\|\boldsymbol{c}_{1}\|<\infty$ . Since the assumption that ${\boldsymbol{\Sigma}}_{12}\neq\boldsymbol{0}$ leads in either case to a contradiction then it follows that ${\boldsymbol{\Sigma}}_{12}=\boldsymbol{0}$ .

Since ${\boldsymbol{\Sigma}}_{12}=\boldsymbol{0}$ then Equation (4.5) reduces to

[TABLE]

equivalently, $g^{\prime}(t)=c_{3}g(t)$ , hence $g(t)=c_{3}\exp(-c_{4}t)$ , for some constants $c_{3}$ and $c_{4}$ . Therefore, ${\boldsymbol{W}}$ has a truncated multivariate normal distribution with ${\boldsymbol{\Sigma}}_{12}=\boldsymbol{0}$ . ∎

5 Testing the independence of the components of a truncated multivariate normal vector

As an application of our results, we perform a likelihood ratio test for independence between $W_{1}$ and $W_{2}$ , the components of a bivariate truncated normal random vector. For $j,k=1,2$ , denote by $\sigma_{jk}$ the $(j,k)$ th element of ${\boldsymbol{\Sigma}}$ ; let $\sigma_{j}=\sigma_{jj}^{1/2}$ ; and set $\rho={\sigma_{12}}/(\sigma_{1}\sigma_{2})$ . By Theorem 3.1, testing for independence between $W_{1}$ and $W_{2}$ is equivalent to testing the null hypothesis, $H_{0}:\rho=0$ , vs. the alternative hypothesis, $H_{a}:\rho\neq 0$ . For illustrative purposes, we apply the test to a data set, considered by Cohen [9, p. 192], consisting of the entrance examination scores, $W_{1}$ , and subsequent course averages, $W_{2}$ , achieved by $n=529$ university students. The data are viewed as generated randomly from a bivariate truncated normal distribution, with the cutoff value for $W_{1}$ being $159.5$ , the minimum qualifying score on the entrance examination, and with the cutoff value for $W_{2}$ being $c_{2}=0$ since all course averages are nonnegative, respectively. With these constraints, $n=517$ students were admitted.

Corresponding to $H_{a}$ , we denote the unrestricted (or alternative) parameter space by ${\boldsymbol{\Theta}}=\{{\boldsymbol{\theta}}=(\mu_{1},\mu_{2},\sigma_{1},\sigma_{2},\rho)^{\prime}:\mu_{1}\in\mathbb{R},\mu_{2}\in\mathbb{R},\sigma_{1}>0,\sigma_{2}>0,-1<\rho<1\}$ ; wherever necessary, we may also denote the respective individual components of ${\boldsymbol{\theta}}$ by $\theta_{i}$ , $i=1,\ldots,5$ . The restricted (or null) parameter space, as determined by $H_{0}$ , then is ${\boldsymbol{\Theta}}_{0}=\{\mu_{1},\mu_{2}\in\mathbb{R},\sigma_{1},\sigma_{2}>0,\rho=0\}$ , and the likelihood ratio test statistic for testing $H_{0}$ vs. $H_{a}$ is

[TABLE]

For $\boldsymbol{w}=(w_{1},w_{2})$ , we write the joint probability density function of ${\boldsymbol{W}}$ in the form

[TABLE]

$w_{1}\geq c_{1}$ , $w_{2}\geq c_{2}$ , where

[TABLE]

and

[TABLE]

is the normalizing constant.

For a random sample $(W_{1,1},W_{2,1})^{\prime},\ldots,(W_{1,n},W_{2,n})^{\prime}$ from the distribution (5.1), the corresponding log-likelihood function can be written, up to additive constants that do not depend on the parameter ${\boldsymbol{\theta}}$ , as

[TABLE]

The asymptotic null distribution of the likelihood ratio statistic is derived from a classical theorem of Wilks [18] (cf., Casella and Berger [8, pp. 489, 516], Hogg, et al. [13, p. 361]). First, we verify that the regularity conditions underlying Wilks’ theorem are valid for the truncated normal distribution:

The density $f(\boldsymbol{w};{\boldsymbol{\theta}})$ is identifiable, i.e., if $f(\boldsymbol{w};{\boldsymbol{\theta}}_{1})=f(\boldsymbol{w};{\boldsymbol{\theta}}_{2})$ for all $\boldsymbol{w}\geq\boldsymbol{c}$ then ${\boldsymbol{\theta}}_{1}={\boldsymbol{\theta}}_{2}$ : To prove this result, note that

[TABLE]

then it follows from (5.1) that the truncated bivariate normal distribution is an exponential family with natural (or canonical) sufficient statistic

[TABLE]

and corresponding canonical parameter vector

[TABLE]

It is now evident that the components of the natural sufficient statistic and of the canonical parameter vector are linearly independent over $\mathbb{R}^{5}$ . Further, the exponential family is minimal, meaning that it is five-dimensional and cannot be reduced to a lower-dimensional model. Consequently, by Barndorff-Nielsen [5, pp. 112–113, Lemma 8.1 and Corollary 8.2], the model (5.1) is identifiable.

2.

The support of the distribution remains the same for all values of ${\boldsymbol{\theta}}$ : This condition is clearly satisfied since the density $f(\boldsymbol{w};{\boldsymbol{\theta}})$ has support $(c_{1},\infty)\times(c_{2},\infty)$ , which does not depend on ${\boldsymbol{\theta}}$ .

3.

There exists an open subset ${\boldsymbol{\Omega}}_{0}\subset{\boldsymbol{\Theta}}$ such that the “true value” of the parameter ${\boldsymbol{\theta}}$ is in ${\boldsymbol{\Omega}}_{0}$ , and all third-order partial derivatives of $f(\boldsymbol{w};{\boldsymbol{\theta}})$ with respect to $\boldsymbol{w}$ exist for all ${\boldsymbol{\theta}}\in{\boldsymbol{\Omega}}_{0}$ : This condition is satisfied since ${\boldsymbol{\Theta}}$ is an open subset of $\mathbb{R}^{5}$ and we can construct ${\boldsymbol{\Omega}}_{0}$ consisting of the union of sufficiently small open univariate balls around the true value of each of the parameters $\theta_{1},\ldots,\theta_{5}$ . Further, the differentiability property follows from (5.1).

4.

The integral $\int f(\boldsymbol{w}|{\boldsymbol{\theta}})\hskip 1.0pt{\rm{d}}\boldsymbol{w}$ is twice-differentiable with respect to ${\boldsymbol{\theta}}$ : According to the usual Leibniz rule, partial derivatives and integrals may be interchanged whenever the same derivatives of the density function $f(\boldsymbol{w}|{\boldsymbol{\theta}})$ are continuous and integrable for all ${\boldsymbol{\theta}}\in{\boldsymbol{\Theta}}$ and all $w_{1}\geq c_{1}$ , $w_{2}\geq c_{2}$ ; see Burkill and Burkill [6, p. 289, Theorem 8.72]. In the case of (5.1), the conditions for Leibniz’ rule follows from the finiteness of the moments of any positive order for that distribution. We also note that Barndorff-Nielsen [5, p. 114, Theorem 8.1] shows that differentiation with respect to $\theta$ , to any order, of the integral is allowed under the integral sign.

5.

The information matrix $I({\boldsymbol{\theta}})$ of the density function $f(\boldsymbol{w};{\boldsymbol{\theta}})$ is positive definite: As shown earlier, $f(\boldsymbol{w};{\boldsymbol{\theta}})$ is a non-curved minimal exponential model. By a well-known result for exponential families [5, Section 9.3], the covariance matrix of $\boldsymbol{V}$ , denoted by ${\mathrm{Cov}}(\boldsymbol{V})$ , is a full-rank matrix and therefore is positive definite. Since the information matrix is

[TABLE]

then it follows that $I({\boldsymbol{\theta}})$ also is of full rank.

6.

All third-order partial derivatives of $\log f(\boldsymbol{w};{\boldsymbol{\theta}})$ are bounded by functions of $\boldsymbol{w}$ that have finite expectations: By straightforward differentiation with respect to $\theta_{j}$ , $\theta_{k}$ , and $\theta_{l}$ , $1\leq j,k,l\leq 5$ , we obtain

[TABLE]

where $P_{jkl}(\boldsymbol{w})$ is a polynomial in $w_{1},w_{2}$ . Since all polynomial moments of the truncated bivariate normal distribution are finite then $\mathbb{E}\,P_{jkl}({\boldsymbol{W}})<\infty$ for all $j,k,l$ .

Having shown that the regularity conditions underpinning Wilks’ theorem are satisfied in our setting, we deduce that, under $H_{0}$ , $-2\log\Lambda\rightarrow\chi^{2}_{1}$ in distribution as $n\rightarrow\infty$ .

To apply to the data of Cohen [9, loc. cit.] the likelihood ratio statistic for testing $H_{0}$ vs. $H_{a}$ , we calculated the maximum likelihood estimates of the parameters of the bivariate truncated normal distribution using the R package of Wilhelm and Manjunath [19]; alternatively, the calculations can be done using the procedures described by Cohen [9, pp. 186–190]. We obtained from the R package the estimates,

[TABLE]

The resulting observed value of the test statistic $-2\log\Lambda$ was $84.905$ , and the corresponding P-value was found to be approximately $3.130423*10^{-20}$ . Consequently, the null hypothesis $H_{0}$ is rejected at any practical level of significance.

6 Conclusions

We have shown that the mutual independence of the components of a multivariate truncated elliptical distribution is equivalent to ${\boldsymbol{\Sigma}}_{12}=\boldsymbol{0}$ subject to additional regularity conditions on the generator function $g$ . If these regularity conditions are satisfied then they imply that the underlying distribution is the truncated multivariate normal distribution. These results suggest two problems for future research. The first problem concerns the existence of multivariate truncated elliptical distributions, other than the truncated normal, for which ${\boldsymbol{\Sigma}}_{12}=\boldsymbol{0}$ is equivalent to independence of its components. This problem leads naturally to a search for regularity conditions weaker than the ones that we have used in Corollary 4.1. The second direction is to characterize the property of uncorrelatedness for the multivariate truncated elliptical distributions; explicitly, the goal will be to obtain explicit criteria, in terms of the correlation matrix of the underlying multivariate elliptical distribution and its generator function, that are equivalent to zero correlation between components of its truncated analogs, ${\boldsymbol{W}}_{1}$ and ${\boldsymbol{W}}_{2}$ . We plan to study both of these directions in future research.

Bibliography19

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1]
2[2] Amemiya, T. (1974). Multivariate regression and simultaneous equations models when the dependent variables are truncated normal. Econometrica , 42 , 999-1012.
3[3] Anderson, T. W. (2003). An Introduction to Multivariate Statistical Analysis , third edition. Wiley, New York, NY.
4[4] Artin, E. (1964). The Gamma Function . Holt-Rinehart-Winston, New York, NY.
5[5] Barndorff-Nielsen, O. E. (2014). Information and Exponential Families in Statistical Theory , second printing. Wiley, Chichester.
6[6] Burkill, J. C., and Burkill, H. (2002). A Second Course in Mathematical Analysis . Cambridge University Press, New York, NY.
7[7] Fang, K.-T., Kotz, S., and Ng, K. W. (1990). Symmetric Multivariate and Related Distributions , Chapman & Hall, New York, NY.
8[8] Casella, G., and Berger, R. L. (2002). Statistical Inference , second edition, Duxbury Press, Pacific Grove, CA.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Independence Properties of the Truncated Multivariate Elliptical Distributions

Abstract

1 Introduction

2 Correlation properties of truncated elliptical distributions

3 The multivariate normal case

Theorem 3.1**.**

Remark 3.2**.**

4 The elliptical case

Corollary 4.1**.**

Proof.

5 Testing the independence of the components of a truncated multivariate normal vector

6 Conclusions

Theorem 3.1.

Remark 3.2.

Corollary 4.1.