On Minimax Detection of Gaussian Stochastic Sequences with Imprecisely   Known Means and Covariance Matrices

Marat V. Burnashev

arXiv:2302.13254·cs.IT·February 28, 2023

On Minimax Detection of Gaussian Stochastic Sequences with Imprecisely Known Means and Covariance Matrices

Marat V. Burnashev

PDF

Open Access

TL;DR

This paper investigates the minimax detection of Gaussian sequences with uncertain means and covariances, identifying conditions under which composite hypotheses can be simplified without loss of detection performance.

Contribution

It characterizes the maximal set of means and covariance matrices allowing composite hypothesis testing to be replaced by simple hypothesis testing without affecting the detection exponent.

Findings

01

Complete description of the maximal set of parameters

02

Conditions for equivalence between composite and simple hypothesis testing

03

Analysis of detection exponent under uncertainty

Abstract

We consider the problem of detecting (testing) Gaussian stochastic sequences (signals) with imprecisely known means and covariance matrices. The alternative is independent identically distributed zero-mean Gaussian random variables with unit variances. For a given false alarm (1st-kind error) probability, the quality of minimax detection is given by the best miss probability (2nd-kind error probability) exponent over a growing observation horizon. We explore the maximal set of means and covariance matrices (composite hypothesis) such that its minimax testing can be replaced with testing a single particular pair consisting of a mean and a covariance matrix (simple hypothesis) without degrading the detection exponent. We completely describe this maximal set.

Equations208

H_{0} : y_{n} = ξ_{n}, ξ_{n} \sim N (0, I_{n}), H_{1} : y_{n} = η_{n}, η_{n} \sim N (a_{n}, M_{n}),

H_{0} : y_{n} = ξ_{n}, ξ_{n} \sim N (0, I_{n}), H_{1} : y_{n} = η_{n}, η_{n} \sim N (a_{n}, M_{n}),

H_{0} :

H_{0} :

H_{1} :

B_{n} = (b_{n}, V_{n}), b_{n} \in A_{n}, V_{n} \in M_{n}, F_{n} = {B_{n}} = (A_{n}, M_{n}) .

B_{n} = (b_{n}, V_{n}), b_{n} \in A_{n}, V_{n} \in M_{n}, F_{n} = {B_{n}} = (A_{n}, M_{n}) .

y_{n} \in D \Rightarrow H_{0}, y_{n} \neq \in D \Rightarrow H_{1},

y_{n} \in D \Rightarrow H_{0}, y_{n} \neq \in D \Rightarrow H_{1},

α (D) = P (y_{n} \neq \in D ∣ H_{0})

α (D) = P (y_{n} \neq \in D ∣ H_{0})

β (D, A_{n}, M_{n}) = a_{n} \in A_{n} sup M_{n} \in M_{n} sup P (y_{n} \in D ∣ M_{n}, a_{n}) .

β (D, A_{n}, M_{n}) = a_{n} \in A_{n} sup M_{n} \in M_{n} sup P (y_{n} \in D ∣ M_{n}, a_{n}) .

β (α, A_{n}, M_{n}) = D : α (D) \leq α in f β (D, A_{n}, M_{n}),

β (α, A_{n}, M_{n}) = D : α (D) \leq α in f β (D, A_{n}, M_{n}),

M_{n} \in M_{n} sup β (α, a_{n}, M_{n}) \leq β (α, a_{n}, M_{n}) .

M_{n} \in M_{n} sup β (α, a_{n}, M_{n}) \leq β (α, a_{n}, M_{n}) .

a_{n} \in A_{n} sup M_{n} \in M_{n} sup β (α, a_{n}, M_{n}) \leq β (α, A_{n}, M_{n}) .

a_{n} \in A_{n} sup M_{n} \in M_{n} sup β (α, a_{n}, M_{n}) \leq β (α, A_{n}, M_{n}) .

n \to \infty lim \frac{1}{n} ln β (α, A_{n}, M_{n}) = n \to \infty lim \frac{1}{n} ln β (α, a_{n}, M_{n}) .

n \to \infty lim \frac{1}{n} ln β (α, A_{n}, M_{n}) = n \to \infty lim \frac{1}{n} ln β (α, a_{n}, M_{n}) .

n \to \infty lim \frac{1}{n} ln β (F_{0} (F_{n})) = n \to \infty lim \frac{1}{n} ln β (F_{n}) .

n \to \infty lim \frac{1}{n} ln β (F_{0} (F_{n})) = n \to \infty lim \frac{1}{n} ln β (F_{n}) .

ln p_{I_{n}} (y_{n})

ln p_{I_{n}} (y_{n})

ln p_{F_{n}} (y_{n})

r_{\boldsymbol{F}_{\!n}}(\boldsymbol{y}_{n})=\ln\frac{p_{\boldsymbol{I}_{n}}}{p_{\boldsymbol{F}_{\!n}}}(\boldsymbol{y})\\ \\ =\frac{1}{2}\Bigl{[}\ln|\boldsymbol{M}_{\!n}|+(\boldsymbol{y},(\boldsymbol{M}_{\!n}^{-1}-\boldsymbol{I}_{n})\boldsymbol{y})-2(\boldsymbol{y},\boldsymbol{M}_{\!n}^{-1}\boldsymbol{a}_{n})+(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n}^{-1}\boldsymbol{a}_{n})\Bigr{]}.

r_{\boldsymbol{F}_{\!n}}(\boldsymbol{y}_{n})=\ln\frac{p_{\boldsymbol{I}_{n}}}{p_{\boldsymbol{F}_{\!n}}}(\boldsymbol{y})\\ \\ =\frac{1}{2}\Bigl{[}\ln|\boldsymbol{M}_{\!n}|+(\boldsymbol{y},(\boldsymbol{M}_{\!n}^{-1}-\boldsymbol{I}_{n})\boldsymbol{y})-2(\boldsymbol{y},\boldsymbol{M}_{\!n}^{-1}\boldsymbol{a}_{n})+(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n}^{-1}\boldsymbol{a}_{n})\Bigr{]}.

D_{LR} (F_{n}, α) = {y_{n} \in R^{n} : r_{F_{n}} (y_{n}) \geq γ},

D_{LR} (F_{n}, α) = {y_{n} \in R^{n} : r_{F_{n}} (y_{n}) \geq γ},

α

α

\displaystyle=\operatorname{\mathbf{P}}\nolimits_{\boldsymbol{I}_{n}}\Bigl{\{}[\ln|\boldsymbol{M}_{\!n}|+(\boldsymbol{\xi}_{n},(\boldsymbol{M}_{\!n}^{-1}-\boldsymbol{I}_{n})\boldsymbol{\xi}_{n})-2(\boldsymbol{\xi}_{n},\boldsymbol{M}_{\!n}^{-1}\boldsymbol{a}_{n})+(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n}^{-1}\boldsymbol{a}_{n})]\leq 2\gamma\Bigr{\}}.

n \to \infty lim \frac{1}{n} ln (b_{n}, V_{n}) \in F_{n}^{LR} (a_{n}, M_{n}) sup β (b_{n}, V_{n}) = n \to \infty lim \frac{1}{n} ln β (a_{n}, M_{n}),

n \to \infty lim \frac{1}{n} ln (b_{n}, V_{n}) \in F_{n}^{LR} (a_{n}, M_{n}) sup β (b_{n}, V_{n}) = n \to \infty lim \frac{1}{n} ln β (a_{n}, M_{n}),

D (P ∥ Q) = E_{P} ln \frac{d P}{d Q} (x) \geq 0,

D (P ∥ Q) = E_{P} ln \frac{d P}{d Q} (x) \geq 0,

D (P_{I_{n}} ∥ Q_{a_{n}, M_{n}})

D (P_{I_{n}} ∥ Q_{a_{n}, M_{n}})

\displaystyle=\frac{1}{2}\Bigl{[}\ln|\boldsymbol{M}_{\!n}|+(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n}^{-1}\boldsymbol{a}_{n})+\operatorname{\mathbf{E}}\nolimits_{\boldsymbol{\xi}_{n}}(\boldsymbol{\xi}_{n},(\boldsymbol{M}_{\!n}^{-1}-\boldsymbol{I}_{n})\boldsymbol{\xi}_{n})\Bigr{]}

\displaystyle=\frac{1}{2}\Biggl{[}\,\sum_{i=1}^{n}(\ln\lambda_{i}+\frac{1}{\lambda_{i}}-1)+(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n}^{-1}\boldsymbol{a}_{n})\Biggr{]},

n \to \infty lim \frac{1}{n} i = 1 \sum n (ln λ_{i} (M_{n}) + \frac{1}{λ _{i} ( M _{n} )} - 1)

n \to \infty lim \frac{1}{n} i = 1 \sum n (ln λ_{i} (M_{n}) + \frac{1}{λ _{i} ( M _{n} )} - 1)

n \to \infty lim \frac{1}{n} M_{n} \in M_{n} sup i = 1 \sum n \frac{1}{λ _{i} ( M _{n} )} - 1^{1 + δ} < \infty.

n \to \infty lim \frac{1}{n} M_{n} \in M_{n} sup i = 1 \sum n \frac{1}{λ _{i} ( M _{n} )} - 1^{1 + δ} < \infty.

f_{a_{n}, M_{n}} (b_{n}, V_{n}) = \frac{∣ M _{n} ∣ e ^{- K}}{∣ V _{n} ∣ ∣ B _{n} ∣},

f_{a_{n}, M_{n}} (b_{n}, V_{n}) = \frac{∣ M _{n} ∣ e ^{- K}}{∣ V _{n} ∣ ∣ B _{n} ∣},

B_{n} = I_{n} + V_{n}^{- 1} - M_{n}^{- 1}, d = B_{n}^{- 1} (V_{n}^{- 1} b_{n} - M_{n}^{- 1} a_{n}), K = (b_{n}, V_{n}^{- 1} b_{n}) - (a_{n}, M_{n}^{- 1} a_{n}) - (d, B_{n} d) .

B_{n} = I_{n} + V_{n}^{- 1} - M_{n}^{- 1}, d = B_{n}^{- 1} (V_{n}^{- 1} b_{n} - M_{n}^{- 1} a_{n}), K = (b_{n}, V_{n}^{- 1} b_{n}) - (a_{n}, M_{n}^{- 1} a_{n}) - (d, B_{n} d) .

F_{0} (a_{n}, M_{n})

F_{0} (a_{n}, M_{n})

\displaystyle=\Bigl{\{}(\boldsymbol{b}_{n},\boldsymbol{V}_{\!n}):\>\boldsymbol{I}_{n}+\boldsymbol{V}_{\!n}^{-1}-\boldsymbol{M}_{\!n}^{-1}>\mathbf{0},\>f_{\boldsymbol{a}_{n},\boldsymbol{M}_{\!n}}(\boldsymbol{b}_{n},\boldsymbol{V}_{\!n})\leq e^{o(n)}\Bigr{\}},

F (a_{n}, M_{n}) = F^{LR} (a_{n}, M_{n}) = F_{0} (a_{n}, M_{n}),

F (a_{n}, M_{n}) = F^{LR} (a_{n}, M_{n}) = F_{0} (a_{n}, M_{n}),

a V_{n}^{(1)} + (1 - a) V_{n}^{(2)} \in F (a_{n}, M_{n}), for any 0 \leq a \leq 1.

a V_{n}^{(1)} + (1 - a) V_{n}^{(2)} \in F (a_{n}, M_{n}), for any 0 \leq a \leq 1.

μ_{i} = 1 + \frac{1}{ν _{i}} - \frac{1}{λ _{i}}, i = 1, \dots, n .

μ_{i} = 1 + \frac{1}{ν _{i}} - \frac{1}{λ _{i}}, i = 1, \dots, n .

K=\sum\limits_{i=1}^{n}\biggl{[}\frac{b_{i,n}^{2}}{\nu_{i}}-\frac{a_{i,n}^{2}}{\lambda_{i}}-\frac{1}{\mu_{i}}\Bigl{(}\frac{b_{i,n}}{\nu_{i}}-\frac{a_{i,n}}{\lambda_{i}}\Bigr{)}^{2}\biggr{]}.

K=\sum\limits_{i=1}^{n}\biggl{[}\frac{b_{i,n}^{2}}{\nu_{i}}-\frac{a_{i,n}^{2}}{\lambda_{i}}-\frac{1}{\mu_{i}}\Bigl{(}\frac{b_{i,n}}{\nu_{i}}-\frac{a_{i,n}}{\lambda_{i}}\Bigr{)}^{2}\biggr{]}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Sensor Networks and Detection Algorithms · Multi-Criteria Decision Making

Full text

Problems of Information Transmission,

vol. 58, no. 3, pp. 70–84, 2022.

M. V. Burnashev

On Minimax Detection of Gaussian Stochastic Sequences with Imprecisely Known Means and Covariance Matrices 111This work was supported by the Russian Foundation for Basic Research under Grant 19-01-00364.

Abstract

We consider the problem of detecting (testing) Gaussian stochastic sequences (signals) with imprecisely known means and covariance matrices. The alternative is independent identically distributed zero-mean Gaussian random variables with unit variances. For a given false alarm (1st-kind error) probability, the quality of minimax detection is given by the best miss probability (2nd-kind error probability) exponent over a growing observation horizon. We explore the maximal set of means and covariance matrices (composite hypothesis) such that its minimax testing can be replaced with testing a single particular pair consisting of a mean and a covariance matrix (simple hypothesis) without degrading the detection exponent. We completely describe this maximal set. Key words and phrases: Minimax testing of hypotheses, error exponent, type-I error probability, type-II error probability, Stein’s exponent.

1 Introduction and the Main Results

1.1 Problem Setting

One of traditional problems of testing simple hypotheses $\mathcal{H}_{0}$ and $\mathcal{H}_{1}$ , concerning

Gaussian signal vector $\boldsymbol{\eta}_{n}$ in the Gaussian noise background $\boldsymbol{\xi}_{n}$ (i.e., the problem of signal detection in the noise background), based on observations $\boldsymbol{y}_{n}^{T}=\boldsymbol{y}_{n}^{\prime}=(y_{1},\ldots,y_{n})\in\mathbb{R}^{n}$ has the form

[TABLE]

where the sample $\boldsymbol{\xi}_{n}^{T}=(\xi_{1},\ldots,\xi_{n})$ represents “noise” and consists of independent identically distributed Gaussian random variables with zero means and variances $1$ , and $\boldsymbol{I}_{n}$ – unit covariance matrix. Stochastic “signal” $\boldsymbol{\eta}_{n}$ is the Gaussian random variable with known mean $\boldsymbol{a}_{n}$ and known covariance matrix $\boldsymbol{M}_{\!n}$ .

However, in practice, we usually do not know precisely the mean $\boldsymbol{a}_{n}$ and the matrix $\boldsymbol{M}_{\!n}$ , and then, in reality, the observation model (1) takes the form

[TABLE]

where $\mathcal{A}_{n}$ – given set of possible means $\boldsymbol{a}_{n}$ , and $\mathcal{M}_{n}$ – given set of possible covariance matrices $\boldsymbol{M}_{\!n}$ (probably, depending on $\boldsymbol{a}_{n}$ ). We denote for convenience

[TABLE]

Further, for the model (2) we consider the problem of minimax testing [1, 2, 3] of the simple hypothesis $\mathcal{H}_{0}$ against the composite alternative $\mathcal{H}_{1}$ , based on observations $\boldsymbol{y}_{n}^{T}=\boldsymbol{y}_{n}^{\prime}=(y_{1},\ldots,y_{n})\in\mathbb{R}^{n}$ . If for making decision in favor of $\mathcal{H}_{0}$ a set $\mathcal{D}\in\mathbb{R}^{n}$ is chosen, such that

[TABLE]

then the 1st-kind error probability (“false alarm”) $\alpha(\mathcal{D})$ and the 2nd-kind error

probability (“miss probability”) $\beta(\mathcal{D},\mathcal{A}_{n},\mathcal{M}_{n})$ , are defined by formulas, respectively,

[TABLE]

and

[TABLE]

We are interested in the minimal possible 2nd-kind error probability $\beta(\mathcal{D},\mathcal{A}_{n},\mathcal{M}_{n})$ (see (4) and (5)), provided a given 1st-kind error probability $\alpha$ , $0<\alpha<1$ :

[TABLE]

and in the corresponding optimal decision set $\mathcal{D}(\alpha)$ from (3).

In the paper, we consider the case when the value $\alpha$ is fixed (or vanishes slowly with $n\to\infty$ ). That case sometimes is called Neyman-Pearson problem of minimax testing of hypotheses. In that case the 1st-kind and the 2nd-kind errors imply very different losses for the statistician, and he is mainly interested in minimization of the 2nd-kind error probability $\beta=\operatorname{\mathbf{P}}\nolimits\{\mathcal{H}_{0}\mathchoice{\hskip 1.5pt|\hskip 1.5pt}{\hskip 1.5pt|\hskip 1.5pt}{\hskip 0.5pt|\hskip 0.5pt}{\hskip 0.3pt|\hskip 0.3pt}\mathcal{H}_{1}\}$ . The case is quite popular in various applications (see, e.g., [4] and bibliography therein).

For given mean $\boldsymbol{a}_{n}$ , matrix $\boldsymbol{M}_{\!n}$ and the value $\alpha$ denote by $\beta(\alpha,\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ the minimal possible 2nd-kind error probability (see (6)). The corresponding optimal decision set $\mathcal{D}(\alpha,\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ is described by Neyman – Pearson lemma [1, 2]. Clearly,

[TABLE]

For a fixed $\alpha$ and given sets $\mathcal{A}_{n},\mathcal{M}_{n}$ , denote also by $\beta(\alpha,\mathcal{A}_{n},\mathcal{M}_{n})$ the minimal possible 2nd-kind error probability (see (6)). Then similarly to (7) we have

[TABLE]

In many practical cases the value $\beta(\alpha,\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ decreases exponentially in $n\to\infty$ . Therefore, it is natural (in any case, simpler and more productive) to investigate the corresponding exponents $n^{-1}\ln\beta(\alpha,\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ and $n^{-1}\ln\beta(\alpha,\mathcal{A}_{n},\mathcal{M}_{n})$ as $n\to\infty$ (some results on the equality in (8) are contained in [5]).

In the paper, we investigate sets $\mathcal{F}_{n}=(\mathcal{A}_{n},\mathcal{M}_{n})$ , for which in (8) the following asymptotic equality holds:

[TABLE]

Motivation for investigation minimax testing of hypotheses (detection of signals) is described in detail in [1, 2, 3, 4]. If for given sets of means $\mathcal{A}_{n}$ and matrices $\mathcal{M}_{n}$ the relation (9) holds, then we may replace (without asymptotic losses) the entire set $\mathcal{F}_{n}$ by the particular pair $(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ . Recall that the optimal test for a particular pair $(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ is described by Neyman – Pearson lemma and it reduces to the simple likelihood ratio test (LR-test). Otherwise (without relation (9)), the optimal minimax test is much more complicated Bayes test with respect to the least favorable prior distribution on the set $\mathcal{F}_{n}$ . Therefore, it is natural to investigate when it is possible to replace the given set $\mathcal{F}_{n}$ by a particular pair $\boldsymbol{F}_{\!n}=(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ . But from technical viewpoint it is more convenient to consider the equivalent problem: for a given pair $\boldsymbol{F}_{\!n}$ to find the maximal set of pairs $\mathcal{F}_{n}(\boldsymbol{F}_{\!n})$ , which can be replaced by the pair $\boldsymbol{F}_{\!n}$ . This problem is mainly considered in the paper.

Remark 1. Models (1) and (2) can be reduced to the equivalent models with a diagonal matrix $\boldsymbol{M}_{\!n}$ . Indeed, since $\boldsymbol{M}_{\!n}$ – a covariance matrix (i.e., symmetric and positive definite), there exists an orthogonal matrix $\boldsymbol{T}_{\!n}$ and a diagonal matrix $\boldsymbol{\Lambda}_{n}$ , such that $\boldsymbol{M}_{\!n}=\boldsymbol{T}_{\!n}\boldsymbol{\Lambda}_{n}\boldsymbol{T}_{\!n}^{\prime}$ (see [[6], §§ 4.7–4.9; [7], Theorem 4.1.5]). In addition, the diagonal matrix $\boldsymbol{\Lambda}_{n}=\boldsymbol{T}_{\!n}^{\prime}\boldsymbol{M}_{\!n}\boldsymbol{T}_{\!n}$ consists of the eigenvalues $\{\lambda_{i}\}$ of the matrix $\boldsymbol{M}_{\!n}$ . Note also that for any orthogonal matrix $\mathbf{T}_{n},$ the vector $\boldsymbol{T}_{\!n}^{\prime}\boldsymbol{\xi}_{n}$ has the same distribution as that of $\boldsymbol{\xi}_{n}$ (for the simple hypothesis $\mathcal{H}_{0}$ of (2)). Therefore, multiplying both sides of (2) by $\boldsymbol{T}_{\!n}^{\prime}$ , we may reduce the model (2) to the equivalent case with a diagonal matrix $\boldsymbol{M}_{\!n}$ .

Definition 1. For a fixed $\alpha$ , and a given sequence of pairs $\boldsymbol{F}_{\!n}=(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ define by $\mathcal{F}_{0}(\boldsymbol{F}_{\!n})$ the sequence of the largest sets of pairs, such that the equality (9) takes the form

[TABLE]

Clearly, $\boldsymbol{F}_{\!n}\in\mathcal{F}_{0}(\boldsymbol{F}_{\!n})$ .

In other words, for a given 1st-kind error probability $\alpha$ the sequence $\mathcal{F}_{0}(\boldsymbol{F}_{\!n})$ is the largest set of pairs, which can be replaced (without asymptotic losses for $\beta(\mathcal{F}_{0}(\boldsymbol{F}_{\!n}))$ ) by one pair $\boldsymbol{F}_{\!n}$ . Below we describe (Theorem 1) the largest set $\mathcal{F}_{0}(\boldsymbol{F}_{\!n})$ , satisfying (10). It generalizes similar result from [8], where the case $\boldsymbol{a}_{n}=\mathbf{0}_{n}$ was considered. It also strengthens similar result from [4], where for the set $\mathcal{F}_{0}(\mathbf{0}_{n},\boldsymbol{M}_{\!n})$ some lower bounds were obtained.

It is convenient first to investigate similar to $\mathcal{F}_{0}(\boldsymbol{F}_{\!n})$ the maximal sets $\mathcal{F}_{0}^{\rm LR}(\boldsymbol{F}_{\!n})$ , which appear if LR-detector (see Definition 2) is used. It will be shown that $\mathcal{F}_{0}(\boldsymbol{F}_{\!n})=\mathcal{F}_{0}^{\rm LR}(\boldsymbol{F}_{\!n})$ , i.e., LR-detector is asymptotically optimal.

In models (1) and (2) denote by $\operatorname{\mathbf{P}}\nolimits_{\boldsymbol{I}_{n}}$ the distribution of the value $\boldsymbol{y}_{n}=\boldsymbol{\xi}_{n}$ , where $\boldsymbol{\xi}_{n}\sim{\mathcal{N}}(\boldsymbol{0},\boldsymbol{I}_{n})$ . Similarly denote by ${\mathbf{Q}}_{\mathbf{F}_{n}}$ , $\boldsymbol{F}_{\!n}=(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ , the distribution of the value $\boldsymbol{y}_{n}=\boldsymbol{\eta}_{n}$ , where $\boldsymbol{\eta}_{n}\sim{\mathcal{N}}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ . Denote also by $p_{\boldsymbol{I}_{n}}(\boldsymbol{y}_{n})$ and $p_{\boldsymbol{F}_{\!n}}(\boldsymbol{y}_{n})$ , $\boldsymbol{y}_{n}\in\mathbb{R}^{n}$ , corresponding densities of probability distributions. For ( $n\times n$ )-matrix $\boldsymbol{M}_{n}$ denote $|\boldsymbol{M}_{n}|=\det\boldsymbol{M}_{n}$ . Note that, if $|\boldsymbol{M}_{\!n}|\neq 0$ , then

[TABLE]

For $|\boldsymbol{M}_{\!n}|\neq 0$ introduce also the logarithm of the likelihood ratio (see (11))

[TABLE]

Consider first LR-detectors. Introduce the corresponding decision sets $\mathcal{D}_{\rm LR}(\boldsymbol{F}_{\!n},\alpha)$ in favor of the hypothesis $\mathcal{H}_{0}$ (i.e., in favor of the matrix $\boldsymbol{I}_{n}$ ), when simple hypotheses $\boldsymbol{I}_{n}$ and $\boldsymbol{F}_{\!n}$ are tested:

[TABLE]

where $\gamma$ is such that, (see (12))

[TABLE]

Definition 1. For a fixed $\alpha$ and a given sequence of pairs $\boldsymbol{F}_{\!n}=(\boldsymbol{a}_{n},\boldsymbol{M}_{n})$ denote by $\mathcal{F}_{0}^{\rm LR}(\boldsymbol{F}_{\!n})$ the sequence of the largest sets of pairs $(\boldsymbol{b}_{n},\boldsymbol{V}_{\!n})$ , such that

[TABLE]

provided the decision sets $\mathcal{D}_{\rm LR}(\alpha,\boldsymbol{a}_{n},\boldsymbol{M}_{n})$ are used.

Below in Theorem 2 the set $\mathcal{F}_{0}^{\rm LR}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ for the model (2) is described.

We shall also need the following definition [9].

Definition 2. For probability measures $\mathbf{P}$ and $\mathbf{Q}$ on a measurable space $(\cal X,\mathcal{B})$ introduce the function (Kullback–Leibler distance (or divergence) for measures $\mathbf{P}$ and $\mathbf{Q}$ )

[TABLE]

where the expectation is taken over the measure $\mathbf{P}$ .

Using formulas (11) and (16) we have

[TABLE]

where $\{\lambda_{1},\ldots,\lambda_{n}\}$ – the eigenvalues (all positive) of the covariance matrix $\boldsymbol{M}_{\!n}$ , and $\boldsymbol{a}_{n}=(a_{1},\ldots,a_{n})$ .

1.2 Assumptions

In the model (2) denote by $\lambda_{1}(\boldsymbol{M}_{\!n}),\ldots,\lambda_{n}(\boldsymbol{M}_{\!n})$ the eigenvalues (all positive) of the covariance matrix $\boldsymbol{M}_{\!n}$ . We assume that the following assumptions are satisfied:

I. For all covariance matrices $\mathbf{M}_{n}\in{\cal M}_{n}(\mathbf{M}_{n})$ there exists the limit (see (17))

[TABLE]

(note that $\ln z+1/z-1\geq 0$ , $z>0$ ).

II. For some $\delta>0$ we have

[TABLE]

1.3 Main results

We first make an important explanation.

Remark 2. There is the following technical problem when describing the maximal sets $\mathcal{F}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ . The relation (9) has the asymptotic (as $n\to\infty$ ) character. Therefore, the maximal sets $\mathcal{F}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ can also be described only asymptotically (as $n\to\infty$ ). For that purpose, it is mostly convenient to describe the simplest sequence of sets, which gives in the limit the maximal sets $\mathcal{F}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ .

In this paper, for a $n\times n$ -matrix $\boldsymbol{A}_{n}$ we denote $|\boldsymbol{A}_{n}|=\det\boldsymbol{A}_{n}$ . By $(\boldsymbol{x},\boldsymbol{y})$ we denote the inner product of vectors $\boldsymbol{x},\boldsymbol{y}$ . We write $\boldsymbol{A}_{n}>\mathbf{0}$ , if $\boldsymbol{A}_{n}$ is positive definite.

Let $\mathcal{C}_{n}$ – the set of all $n\times n$ -covariance (i.e., symmetric and positive definite) matrices in $\mathbf{R}^{n}$ . For any $\boldsymbol{M}_{\!n},\boldsymbol{V}_{\!n}\in\mathcal{C}_{n}$ , and any $\boldsymbol{a}_{n},\boldsymbol{b}_{n}\in\mathbb{R}^{n}$ define the function

[TABLE]

where

[TABLE]

For a sequence of pairs $(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ introduce the following sequence of sets of pairs $(\boldsymbol{b}_{n},\boldsymbol{V}_{\!n})$ :

[TABLE]

where the function $f_{\boldsymbol{a}_{n},\boldsymbol{M}_{\!n}}(\boldsymbol{b}_{n},\boldsymbol{V}_{\!n})$ is defined in (20).

The following Theorem is the main result of the paper. It describes the sets

$\mathcal{F}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ and $\mathcal{F}^{\rm LR}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ from (10) and (15), respectively.

Theorem 1. * If assumptions (18), (19) hold, then as $n\to\infty$ *

[TABLE]

*where equalities are understood in the sense of Remark *2.

Remark 3. Clearly, $(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})\in\mathcal{F}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ . Moreover, the sets $\mathcal{F}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ and $\mathcal{F}^{\rm LR}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ are convex in $\boldsymbol{b}_{n},\boldsymbol{V}_{\!n}$ . Indeed, it is known [[6], § 8.5,Theorem 4; [7], Theorem 7.6.7], that the function $f(\boldsymbol{A}_{n})=\ln|\boldsymbol{A}_{n}|$ is strictly concave on the convex set of positive definite symmetric matrices in $\mathbf{R}^{n}$ . Therefore, the set $\mathcal{F}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ is convex, i.e. any matrices $\boldsymbol{V}_{\!n}^{(1)}\in\mathcal{F}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ and $\boldsymbol{V}_{\!n}^{(2)}\in\mathcal{F}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ satisfy condition

[TABLE]

In a sense, $\mathcal{F}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ – the set $\mathcal{F}_{0}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ , enlarged by a “thin slice” whose width has the order of $o(n)$ . In other words, $\mathcal{F}_{0}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ can be considered as a “core” of the set $\mathcal{F}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ .

We present also the following simplifying consequence to Theorem 1. Without loss of generality, we may assume that the matrix $\boldsymbol{M}_{\!n}$ is diagonal (see Remark 1) with the eigenvalues $\{\lambda_{i}\}$ (all positive). We also limit ourselves in (23) only to diagonal matrices $\boldsymbol{V}_{\!n}$ with positive eigenvalues $\{\nu_{i}\}$ . The matrix $\boldsymbol{B}_{n}=\boldsymbol{I}_{n}+\boldsymbol{V}_{\!n}^{-1}-\boldsymbol{M}_{\!n}^{-1}$ is diagonal with the eigenvalues $\{\mu_{i}\}$ :

[TABLE]

Then for $\boldsymbol{a}_{n}=(a_{1,n},\ldots,a_{n,n})$ , $\boldsymbol{b}_{n}=(b_{1,n},\ldots,b_{n,n})$ we have from (21)

[TABLE]

Introduce the convex set $\mathcal{C}_{{\rm diag},n}$ of diagonal, positive definite matrices $\boldsymbol{V}_{\!n}$ :

[TABLE]

If $\boldsymbol{M}_{\!n},\boldsymbol{V}_{\!n}\in\mathcal{C}_{{\rm diag},n}$ , then the function $f_{\boldsymbol{a}_{n},\boldsymbol{M}_{\!n}}(\boldsymbol{b}_{n},\boldsymbol{V}_{\!n})$ from (20) takes the form

[TABLE]

where $\{\mu_{i}\}$ are defined in (24), and $K$ is defined in (25). It is supposed also, that $\mu_{i}>0$ , $i=1,\ldots,n$ .

For a sequence of pairs $(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ , $\boldsymbol{M}_{\!n}\in\mathcal{C}_{{\rm diag},n}$ , introduce the following set of pairs $(\boldsymbol{b}_{n},\boldsymbol{V}_{\!n})$ , $\boldsymbol{V}_{\!n}\in\mathcal{C}_{{\rm diag},n}$ :

[TABLE]

where the function $f_{\boldsymbol{a}_{n},\boldsymbol{M}_{\!n}}^{(0)}(\boldsymbol{b}_{n},\boldsymbol{V}_{\!n})$ is defined in (26).

Then the following “inner bound” for $\mathcal{M}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ holds.

Theorem 2. * If assumptions (18), (19) hold, then the set $\mathcal{F}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ contains the set $\mathcal{V}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ :*

[TABLE]

where the set $\mathcal{V}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ is defined in (27).

The set $\mathcal{V}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ is convex in $\boldsymbol{V}_{\!n}$ (see Remark 3).

Further, in $\S\,2$ an auxiliary Theorem 3 is given. In $\S\,3$ Theorem 1 is proved, and in $\S\,4$ as examples some particular cases of the problem are considered.

2 Auxiliary Theorem

In models (1), (2) we first consider the testing of simple hypotheses: the pair $(\mathbf{0}_{n},{\boldsymbol{I}}_{n})$ versus a pair $(\boldsymbol{a}_{n},\boldsymbol{M}_{n})$ . Denote

[TABLE]

Next Theorem is the main auxiliary result of this paper. Its proof follows the proof of Theorem 3 in [8]. A more general result is contained in [10].

Theorem 3. * For the minimal possible $\beta(\alpha)$ , $0<\alpha<1$ , the bounds are valid*

[TABLE]

and

[TABLE]

where $\mu_{0}(\alpha,\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ is defined by the relation

[TABLE]

Note that both bounds (29) and (30) are pure analytical relations without any limiting operations. The lower bound (29) and the upper bound (30) are close to each other, if the value $\mu_{0}(\alpha,\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ is much smaller than $D(\boldsymbol{I}_{n}\mathchoice{\hskip 1.5pt\|\hskip 1.5pt}{\hskip 1.5pt\|\hskip 1.5pt}{\hskip 0.5pt\|\hskip 0.5pt}{\hskip 0.3pt\|\hskip 0.3pt}\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ (which usually has the order of $n$ ).

Next result gives an upper bound of the order $n^{1/p}$ , $p>1$ , for the value $\mu_{0}(\alpha,\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ from (31). Its proof (see Appendix) follows the proof of Lemma 1 in [8].

Lemma 1. *For $\mu_{0}(\alpha,\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ from (30) the upper bound holds *(see (19))

[TABLE]

3 Proof of Theorem 1

Since $\mathcal{F}_{n}^{\rm LR}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})\subseteq\mathcal{F}_{n}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ , in order to prove Theorem 1 it is sufficient to get the “inner bound” for $\mathcal{F}_{n}^{\rm LR}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ , and then to get a similar “outer bound” for $\mathcal{F}_{n}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ .

3.1 “Inner bound” for

$\mathcal{F}_{n}^{\rm LR}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$

We first estimate from above the value $\beta(\alpha,\boldsymbol{b}_{n},\boldsymbol{V}_{\!n})$ . For that purpose in the model (2) we consider the testing of the simple hypothesis $(\mathbf{0}_{n},{\boldsymbol{I}}_{n})$ against the simple alternative $(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ , when $\boldsymbol{a}_{n}$ is known. We use the optimal LR-test with the decision region $\mathcal{D}_{\rm LR}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n},\alpha)=\mathcal{A}_{\mu_{0}}$ in favor of $(\mathbf{0}_{n},{\boldsymbol{I}}_{n})$ (see (13), (14)), where $\mu_{0}=\mu_{0}(\alpha,\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})>0$ is defined in (31). Let us consider another pair $(\boldsymbol{b}_{n},\boldsymbol{V}_{\!n})$ , and evaluate the 2nd-kind error probability $\beta(\alpha,\boldsymbol{b}_{n},\boldsymbol{V}_{\!n})$ , provided the decision region $\mathcal{A}_{\mu_{0}}$ is used. Then

[TABLE]

where $0\leq\mu_{1}\leq\mu_{0}(\alpha,\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ . Due to the assumption (19) and the estimate (32), we have

[TABLE]

Therefore, if

[TABLE]

then by (33)–(35) as $n\to\infty$

[TABLE]

3.2 “Outer bound” for $\mathcal{F}_{n}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$

Now, we get a similar lower bound for $\beta(\alpha,\boldsymbol{b}_{n},\boldsymbol{V}_{\!n})$ . Consider first the testing of the simple hypothesis $(\mathbf{0}_{n},{\boldsymbol{I}}_{n})$ against the simple alternative $(\boldsymbol{a}_{n},\boldsymbol{M}_{n})$ . We use the optimal LR-test with the decision region $\mathcal{D}_{\rm LR}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n},\alpha)=\mathcal{A}_{\mu_{0}}$ in favor of $(\mathbf{0}_{n},{\boldsymbol{I}}_{n})$ (see (13), (14)). Then, denoting $p=p_{\boldsymbol{I}_{n}}$ and $q=p_{\boldsymbol{a}_{n},\boldsymbol{M}_{\!n}}$ , we have for error probabilities

[TABLE]

Consider another pair $(\boldsymbol{b}_{n},\boldsymbol{V}_{\!n})$ . Let $\mathcal{D}\in\mathbb{R}^{n}$ – a decision region in favor of $(\mathbf{0}_{n},\boldsymbol{I}_{n})$ , and $\beta_{\boldsymbol{b}_{n},\boldsymbol{V}_{\!n}}(\mathcal{D})$ and $\alpha=\alpha(\mathcal{D})$ – corresponding error probabilities. Then, denoting $q_{1}=p_{\boldsymbol{b}_{n},\boldsymbol{V}_{\!n}}$ , we need to have for the 2nd-kind error probability $\beta_{\boldsymbol{b}_{n},\boldsymbol{V}_{\!n}}(\mathcal{D})$ (see (37))

[TABLE]

For some $\delta$ , $0\leq\delta\leq 1$ , consider also the probability density

[TABLE]

and the corresponding value $\beta_{\delta}$ for it:

[TABLE]

We have by (38) and (40)

[TABLE]

Note that the probability density $q_{\delta}(\boldsymbol{x})$ corresponds to the Bayes problem statement, when the alternative hypothesis $\mathcal{H}_{1}$ with probability $1-\delta$ coincides with $(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ , and with probability $\delta$ – with $(\boldsymbol{b}_{n},\boldsymbol{V}_{\!n})$ . The value $\beta_{\delta}$ is the corresponding 2nd-kind error probability.

We lowerbound the value $\beta_{\delta}$ . First we have

[TABLE]

For the last term in the right-hand side of (42) we have

[TABLE]

Therefore we get

[TABLE]

Consider the value $D(p(\boldsymbol{x})\mathchoice{\hskip 1.5pt\|\hskip 1.5pt}{\hskip 1.5pt\|\hskip 1.5pt}{\hskip 0.5pt\|\hskip 0.5pt}{\hskip 0.3pt\|\hskip 0.3pt}q_{\delta}(\boldsymbol{x}))$ in the right-hand side of (43). Denoting

[TABLE]

we have by (39) and (44)

[TABLE]

Therefore

[TABLE]

where

[TABLE]

Therefore, by (41), (45) and (46) we need to have

[TABLE]

Note, that since $\ln\operatorname{\mathbf{E}}\nolimits\xi\geq\operatorname{\mathbf{E}}\nolimits\ln\xi$ , then we have from (46)

[TABLE]

Therefore, in order to have (47) fulfilled, we need to have

[TABLE]

Since $\intop\limits p(\boldsymbol{x})\,d\boldsymbol{x}=1$ , the relation (48) is equivalent to the condition

[TABLE]

Note, that

[TABLE]

Then, in order to have (49) fulfilled, we need, at least,

[TABLE]

Setting $\delta\downarrow 0$ , we get from (50) the necessary condition

[TABLE]

which gives the “outer bound” for $\mathcal{F}_{n}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ (see (23)).

Note that the “inner bound” (35), (36) for $\mathcal{F}_{n}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ coincides with (51). Therefore, in order to finish the proof of Theorem 1 it remains us to express analytically the condition (51) via the matrices $\boldsymbol{M}_{\!n},\boldsymbol{V}_{\!n}$ and means $\boldsymbol{a}_{n},\boldsymbol{b}_{n}$ . For that purpose we use the following result.

Lemma 2.

If $\boldsymbol{I}_{n}+\boldsymbol{V}_{\!n}^{-1}-\boldsymbol{M}_{\!n}^{-1}>\mathbf{0}$ , then the formula holds *(see (20)–(22))

[TABLE]

where the function $f_{\boldsymbol{a}_{n},\boldsymbol{M}_{\!n}}(\boldsymbol{b}_{n},\boldsymbol{V}_{\!n})$ is defined in (20).

If the matrix $\boldsymbol{I}_{n}+\boldsymbol{V}_{\!n}^{-1}-\boldsymbol{M}_{\!n}^{-1}$ is not positive definite, then

[TABLE]

Proof. Denoting

[TABLE]

we get by (11)

[TABLE]

Note that (see (54))

[TABLE]

where (see also (21))

[TABLE]

Therefore, we can continue (54) as follows:

[TABLE]

Consider the integral in the right-hand side of (55). If $\boldsymbol{B}_{n}>\mathbf{0}$ , then [6, § 6.9, Theorem 3]

[TABLE]

Otherwise

[TABLE]

Assume first $\boldsymbol{B}_{n}=\boldsymbol{I}_{n}+\boldsymbol{V}_{\!n}^{-1}-\boldsymbol{M}_{\!n}^{-1}>\mathbf{0}$ , i.e., the matrix $\boldsymbol{B}_{n}$ is positive definite. Then, by (55), (56) we get

[TABLE]

If the matrix $\boldsymbol{B}_{n}=\boldsymbol{I}_{n}+\boldsymbol{V}_{\!n}^{-1}-\boldsymbol{M}_{\!n}^{-1}$ is not positive definite, then by (57)

[TABLE]

and therefore the condition (51) can not be satisfied. From (58), (59) Lemma 2 follows.∎

We continue the proof of Theorem 1. Define $\mathcal{F}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ as the maximal set, satisfying the condition

[TABLE]

That set coincides with the definition (22). Therefore, from (35), (51), (52) and (60) Theorem 1 follows.

4 Examples. Particular cases

4.1 Known mean $\boldsymbol{a}_{n}$ and known covariance matrix $\boldsymbol{M}_{\!n}$

We first consider the simplest case of known mean $\boldsymbol{a}_{n}=(a_{1},\ldots,a_{n})$ and known matrix $\boldsymbol{M}_{\!n}$ , and apply Theorem 3. It will allow us to estimate the rate of convergence in Theorem 1. Without loss of generality, we may assume in model (2) that the covariance matrix $\boldsymbol{M}_{\!n}$ is diagonal with positive eigenvalues $\lambda_{1},\ldots,\lambda_{n}$ (see Remark 1). Then (see (17))

[TABLE]

By (29), (30) we get for $D=D(\mathbf{P}_{\mathbf{I}_{n}}||{\mathbf{Q}}_{\mathbf{a}_{n},\mathbf{M}_{n}})$

[TABLE]

where $\mu_{0}(\alpha,\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ is estimated in (32).

In order to estimate $\mu_{0}(\alpha,\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$ simpler than (32), we assume additionally that the following condition is satisfied:

III. There exists $C>0$ , such that

[TABLE]

Then by Chebyshev inequality we have

[TABLE]

In order to have the right-hand side of (64) not exceeding $\alpha$ , it is sufficient to set

[TABLE]

and then (62) takes the form

[TABLE]

which estimates the rate of convergence in (62).

Note also that similarly to (74), (75) we can get

[TABLE]

Therefore the condition III is equivalent to the inequality (see (61) and (65))

[TABLE]

Remark 4. The assumption (63) is fulfilled, for example, in the natural “regular” case, when elements $\boldsymbol{a}_{n+1}$ , $\boldsymbol{M}_{n+1}$ are “continuations” of elements $\boldsymbol{a}_{n}$ , $\boldsymbol{M}_{n}$ .

4.2 Unknown mean $\boldsymbol{a}_{n}$ and known covariance matrix $\boldsymbol{M}_{\!n}$

Consider the case of model (2), when we know the covariance matrix $\boldsymbol{M}_{\!n}$ , but we do not know the mean $\boldsymbol{a}_{n}$ . Without loss of generality we may assume the covariance matrix $\boldsymbol{M}_{\!n}$ diagonal with positive eigenvalues $\lambda_{1},\ldots,\lambda_{n}$ (see Remark 1). Then the function $f_{\boldsymbol{a}_{n},\boldsymbol{M}_{\!n}}(\boldsymbol{b}_{n},\boldsymbol{M}_{n})$ from (20) takes the form

[TABLE]

where for $\boldsymbol{a}_{n}=(a_{1,n},\ldots,a_{n,n})$ and $\boldsymbol{b}_{n}=(b_{1,n},\ldots,b_{n,n})$ we have

[TABLE]

The corresponding maximal set $\mathcal{F}_{1}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})=\{\boldsymbol{b}_{n}\}$ in that case takes the form (see (22))

[TABLE]

where the function $K=K(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n},\boldsymbol{b}_{n})$ is defined in (66).

Note that, if $\boldsymbol{M}_{\!n}=\boldsymbol{I}_{n}$ (i.e., when hypotheses differ only by means $\boldsymbol{a}_{n}$ ) formulas (66), (67) take especially simple form:

[TABLE]

Those results follow also from papers [11, 12] (where that problem was considered in Hilbert and Banach spaces).

4.3 Known mean $\boldsymbol{a}_{n}$ and unknown covariance matrix $\boldsymbol{M}_{\!n}$

We limit ourselves to the case $\boldsymbol{a}_{n}=\mathbf{0}_{n}$ . Then the function $f_{\boldsymbol{a}_{n},\boldsymbol{M}_{\!n}}(\boldsymbol{b}_{n},\boldsymbol{V}_{\!n})$ from (20) for $\boldsymbol{a}_{n}=\boldsymbol{b}_{n}=\mathbf{0}_{n}$ takes the form

[TABLE]

The corresponding maximal set $\mathcal{F}_{1}(\mathbf{0}_{n},\boldsymbol{M}_{\!n})=\{\boldsymbol{V}_{\!n}\}$ in that case takes the form (see (22))

[TABLE]

Formulas (69), (70) coincide with the corresponding results in [8, Theorem 1].

Proof of Lemma 1

Let $\boldsymbol{\xi}_{n}$ – a Gaussian random vector with the distribution $\boldsymbol{\xi}_{n}\sim{\mathcal{N}}(\boldsymbol{0},\boldsymbol{I}_{n})$ , and $\boldsymbol{A}_{n}$ – a symmetric $(n\times n)$ -matrix with eigenvalues $\{a_{i}\}$ . Consider the quadratic form $(\boldsymbol{\xi}_{n},\boldsymbol{A}_{n}\boldsymbol{\xi}_{n})$ . There exists the orthogonal matrix $\boldsymbol{T}_{\!n}$ , such that $\boldsymbol{T}_{\!n}^{\prime}\boldsymbol{A}_{n}\boldsymbol{T}_{\!n}=\boldsymbol{B}_{n}$ , where $\boldsymbol{B}_{n}$ – the diagonal matrix with diagonal elements $\{a_{i}\}$ [6, § 4.7]. Since $\boldsymbol{T}_{\!n}\boldsymbol{\xi}_{n}\sim{\mathcal{N}}(\boldsymbol{0},\boldsymbol{I}_{n})$ , the quadratic forms $(\boldsymbol{\xi}_{n},\boldsymbol{A}_{n}\boldsymbol{\xi}_{n})$ and $(\boldsymbol{\xi}_{n},\boldsymbol{B}_{n}\boldsymbol{\xi}_{n})$ have the same distributions. Therefore, by formula (12) we have

[TABLE]

where

[TABLE]

Introduce the value (see (31))

[TABLE]

Then by (71), (72) and (17) we have for $\alpha_{\mu}$ from (73)

[TABLE]

where

[TABLE]

In order to estimate the value $P_{1}$ in (75), we use the following result [13, Ch. III.5.15]: let $\zeta_{1},\ldots,\zeta_{n}$ – independent random variables with $\operatorname{\mathbf{E}}\nolimits\zeta_{i}=0$ , $i=1,\ldots,n$ . Then for any $1\leq p\leq 2$

[TABLE]

Therefore, using for $P_{1}$ Chebychev inequality and (76), we get

[TABLE]

In order to estimate the value $P_{2}$ in (74), (75), note that

[TABLE]

and then

[TABLE]

Therefore, using the standard bound

[TABLE]

we get ( $\xi_{i}\sim\mathcal{N}(0,1)$ )

[TABLE]

In order to satisfy the condition $\alpha_{\mu}\leq\alpha$ we set $\mu$ , such that $\max\{P_{1},P_{2}\}\leq\alpha/2$ . Then, by (77) and (78) it is sufficient to set $\mu$ , satisfying (32).

FUNDING

Supported in part by the Russian Foundation for Basic Research, project no. 19-01-00364.

Bibliography13

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Wald, A., Statistical Decision Functions , New York: Wiley, 1950. Translated under the title Statisticheskie reshayushchie funktsii , in Pozitsionnyeigry (Positional Games), Moscow: Nauka, 1967, pp. 300–522.
2[2] Lehmann, E.L., Testing Statistical Hypotheses , New York: Wiley, 1959. Translated under the title Proverka statisticheskikh gipotez , Moscow: Nauka, 1979.
3[3] Poor, H.V., An Introduction to Signal Detection and Estimation , New York: Springer-Verlag, 1994, 2nd ed.
4[4] Zhang, W. and Poor, H.V., On Minimax Robust Detection of Stationary Gaussian Signals in White Gaussian Noise, IEEE Trans. Inform. Theory , 2011, vol. 57, no. 6, pp. 3915–3924.
5[5] Burnashev, M.V., On Detection of Gaussian Stochastic Sequences, Probl. Peredachi Inf. , 2017, vol. 53, no. 4, pp. 49–68 [ Probl. Inf. Transm. (Engl. Transl.), 2017, vol. 53, no. 4, pp. 349–367].
6[6] Bellman, R., Introduction to Matrix Analysis , New York: Mc Graw-Hill, 1960. Translated under the title Vvedenie v teoriyu matrits , Moscow: Nauka, 1976.
7[7] Horn, R.A. and Johnson, C.R., Matrix Analysis , Cambridge: Cambridge Univ. Press, 1985. Translated under the title Matrichnyi analiz , Moscow: Mir, 1989.
8[8] Burnashev, M.V., On Minimax Detection of Gaussian Stochastic Sequences and Gaussian Stationary Signals, Probl. Peredachi Inf. , 2021, vol. 57, no. 3, pp. 55–72 [ Probl. Inf. Transm. (Engl. Transl.), 2021, vol. 57, no. 3, pp. 248–264].

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Abstract

1 Introduction and the Main Results

1.1 Problem Setting

1.2 Assumptions

1.3 Main results

2 Auxiliary Theorem

3 Proof of Theorem 1

3.1 “Inner bound” for

3.2 “Outer bound” for Fn(an,M ⁣n)\mathcal{F}_{n}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})Fn​(an​,Mn​)

4 Examples. Particular cases

4.1 Known mean an\boldsymbol{a}_{n}an​ and known covariance matrix M ⁣n\boldsymbol{M}_{\!n}Mn​

4.2 Unknown mean an\boldsymbol{a}_{n}an​ and known covariance matrix M ⁣n\boldsymbol{M}_{\!n}Mn​

4.3 Known mean an\boldsymbol{a}_{n}an​ and unknown covariance matrix M ⁣n\boldsymbol{M}_{\!n}Mn​

FUNDING

3.2 “Outer bound” for $\mathcal{F}_{n}(\boldsymbol{a}_{n},\boldsymbol{M}_{\!n})$

4.1 Known mean $\boldsymbol{a}_{n}$ and known covariance matrix $\boldsymbol{M}_{\!n}$

4.2 Unknown mean $\boldsymbol{a}_{n}$ and known covariance matrix $\boldsymbol{M}_{\!n}$

4.3 Known mean $\boldsymbol{a}_{n}$ and unknown covariance matrix $\boldsymbol{M}_{\!n}$