First- and Second-Order Hypothesis Testing for Mixed Memoryless Sources with General Mixture
Te Sun Han, Ryo Nomura

TL;DR
This paper derives the optimal exponents for first- and second-order hypothesis testing between mixed memoryless sources, extending to more general sources and relating to compound hypothesis testing.
Contribution
It provides the first- and second-order epsilon-exponents for hypothesis testing involving mixed memoryless sources, generalizing previous results and connecting to broader hypothesis testing frameworks.
Findings
Derived second-order epsilon-exponent for mixed null and stationary alternative hypotheses.
Extended results to cases where both hypotheses are mixed memoryless sources.
Discussed implications for general source and compound hypothesis testing.
Abstract
The first- and second-order optimum achievable exponents in the simple hypothesis testing problem are investigated. The optimum achievable exponent for type II error probability, under the constraint that the type I error probability is allowed asymptotically up to epsilon, is called the epsilon-optimum exponent. In this paper, we first give the second-order epsilon-exponent in the case where the null hypothesis and the alternative hypothesis are a mixed memoryless source and a stationary memoryless source, respectively. We next generalize this setting to the case where the alternative hypothesis is also a mixed memoryless source. We address the first-order epsilon-optimum exponent in this setting. In addition, an extension of our results to more general setting such as the hypothesis testing with mixed general source and the relationship with the general compound hypothesis testing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
First- and Second-Order Hypothesis Testing
for Mixed Memoryless Sources with
General Mixture
Te Sun Han, and Ryo Nomura, T. S. Han is with the National Institute of Information and Communications Technology (NICT), Tokyo, Japan, e-mail: [email protected] R. Nomura is with the School of Network and Information, Senshu University, Kanagawa, Japan, e-mail: [email protected]
Abstract
The first- and second-order optimum achievable exponents in the simple hypothesis testing problem are investigated. The optimum achievable exponent for type II error probability, under the constraint that the type I error probability is allowed asymptotically up to , is called the -optimum exponent. In this paper, we first give the second-order -exponent in the case where the null hypothesis and the alternative hypothesis are a mixed memoryless source and a stationary memoryless source, respectively. We next generalize this setting to the case where the alternative hypothesis is also a mixed memoryless source. We address the first-order -optimum exponent in this setting. In addition, an extension of our results to more general setting such as the hypothesis testing with mixed general source and the relationship with the general compound hypothesis testing problem are also discussed.
Index Terms:
General Source, Hypothesis Testing, Information Spectrum, Mixed Source, Optimum Exponent
I Introduction
Let and be two general sources, where we use the term of general source to denote a sequence of random variables (resp. ) indexed by block length , where each component of (resp. ) may vary depending on .
We consider the hypothesis testing problem with null hypothesis , alternative hypothesis and acceptance region . The probabilities of type I error and type II error are defined, respectively, as
[TABLE]
In this paper, we focus mainly on how to determine the -optimum exponent, defined as the supremum of achievable exponents for the type II error probability under the constraint that the type I error probability is allowed asymptotically up to a constant . The fundamental result in this setting is so-called Stein’s lemma [1], which gives the -optimum exponent in the case where the null and alternative hypotheses are stationary memoryless sources. The lemma shows that the -optimum exponent is given by , the divergence between stationary memoryless sources and . Chen [2] has generalized this lemma to the case where both of and are general sources, and established the general formula of -optimum exponent in terms of divergence spectra. The -optimum exponent derived by him is called in this paper the first-order -optimum exponent.
On the other hand, achievable rates called those of the second-order have been investigated in several contexts in information theory [3, 4, 5, 6, 7, 8] in order to investigate finer asymptotic behaviors of the information-theoretic quantities. Strassen [3] has first introduced the notion of the -optimum achievable exponent of the second-order in the hypothesis testing problem, called the second-order -optimum exponent, and derived the second-order -optimum exponent in the case where and are stationary memoryless sources. Han [9] has demonstrated the general formula (though not single-letterized) of the second-order -optimum exponent. The results in [3] and [9] also have revealed that the asymptotic normality of the divergence density rate (or the likelihood ratio rate) plays an important role also in computing the second-order -optimum exponent.
In this paper, on the other hand, we consider the hypothesis testing for mixed memoryless sources in which the asymptotic normality of divergence density rate does not hold. The class of mixed sources is quite important, because all of stationary sources can be regarded as forming mixed sources consisting of stationary ergodic sources. Therefore, the analysis for mixed sources is insightful and so we first focus on the case with mixed memoryless source . In this direction, Han [10] has first derived the formula for the first-order -optimum exponent in the case with mixed memoryless source and stationary memoryless source . One of our first main results is to establish the second-order -optimum exponent in the same setting by invoking the relevant asymptotic normality. The result is a substantial generalization of that of Strassen [3]. Second, we generalize this setting to the case where both of null and alternative hypotheses are mixed memoryless , to establish the first-order -optimum exponent.
It should be emphasized that our results described here are valid for mixed memoryless sources with general mixture in the sense that the mixing weights may be arbitrary probability measures. For the case of mixed memoryless sources (or also mixed geneal sources) with discrete mixture, we can reveal the deep relationship with the compound hypothesis testing problem. We notice that the compound hypothesis testing problem is important from both theoretical and practical points of view. We show that first-order [math]-optimum (resp. exponentially -optimum) exponents for the mixed general hypothesis testing coincide with those for the [math]-optimum (resp. exponentially -optimum) exponents in the compound general hypothesis testing.
The present paper is organized as follows. In Section II, we define the problem setting and review the general formulas for the first- and second-order -optimum exponents. In Section III, we establish the second-order -optimum exponents in the case with mixed memoryless source and stationary memoryless source . In Section IV, we consider the case where both of null and alternative hypotheses are mixed memoryless sources, and derive the first-order -optimum exponent. Section V is devoted to an extension of mixed memoryless sources to mixed general sources. Finally, in Section VI we define the optimum exponent for the compound general hypothesis testing problem and discuss the relevant relationship with the hypothesis testing with mixed general sources. We conclude the paper in Section VII.
II General formulas for -hypothesis testing
We first review the first-order general formula and derive the second-order general formula. Throughout in this paper, the following lemmas play the important role and we use the notation that indicates the probability distribution of random variable .
Lemma II.1** ([10, Lemma 4.1.1])**
For any , define the acceptance region as
[TABLE]
then, it holds that
[TABLE]
Lemma II.2** ([10, Lemma 4.1.2])**
For any and any , it holds that
[TABLE]
Although the proof of these lemmas is simple and found in [10], we record it in Appendix A for self-containedness.
We define the first and second-order -optimum exponents as follows.
Definition II.1
Rate is said to be -achievable, if there exists an acceptance region such that
[TABLE]
Definition II.2** (First-order -optimum exponent)**
[TABLE]
The right-hand side of (2.1) specifies the asymptotic behavior of the form . Chen [2] has derived the general formula for :
Theorem II.1** (Chen [2])**
[TABLE]
where
[TABLE]
Proof:
The proof is found in [10] and similar to that of Theorem II.2 below. ∎
Definition II.3
Rate is said to be -achievable, if there exists an acceptance region such that
[TABLE]
Definition II.4** (Second-order -optimum exponent)**
[TABLE]
The right-hand side of (2.4) specifies the asymptotic behavior of the form . Han [9] has derived the general formula for :
Theorem II.2** (Han [9])**
[TABLE]
where
[TABLE]
Proof:
The proof consists of two parts.
- Direct Part:
Set . Then, we show that is -achievable for .
Define the acceptance region as
[TABLE]
Then, from Lemma II.1 with we have the upper bound for the type II error probability :
[TABLE]
from which it follows that
[TABLE]
We next evaluate the type I error probability . Noting that
[TABLE]
we have
[TABLE]
because by the definition. Hence, from (2.7) and (2.8), is -achievable. Since is arbitrary, the direct part has been proved.
- Converse Part:
Suppose that is -achievable. Then, there exists an acceptance region such that
[TABLE]
We fix this acceptance region . The second inequality means that for any
[TABLE]
holds for sufficiently large . On the other hand, from Lemma II.2 with it holds that
[TABLE]
Substituting (2.10) into this inequality, we have
[TABLE]
for sufficiently large . Thus, we have
[TABLE]
Here, from (2.9) we have
[TABLE]
which means that
[TABLE]
Since is arbitrarily, the proof of the converse part has been completed. ∎
III Mixed memoryless sources
III-A First-order -optimum exponent
In the previous section, we have reviewed the formula for general hypothesis testings. In this and subsequent sections, we consider special but insightful cases and compute the optimum exponents in the single-letterized form. Let be an arbitrary probability space with general probability measure . Then, the hypothesis testing problem to be considered in this section is stated as follows.
- •
The null hypothesis is a mixed stationary memoryless source , that is, for
[TABLE]
where is a stationary memoryless source for each and
[TABLE]
with generic random variable taking values in .
- •
The alternative hypothesis is a stationary memoryless source with generic random variable taking values in , that is,
[TABLE]
We assume to be a finite alphabet hereafter. In order to treat this special case, first we introduce an expurgated parameter set on the basis of types, where the type of sequence is the empirical distribution of , that is, with the number of such that .
Let denote all possible types of sequences of length . Then, it is well-known that
[TABLE]
Now for each , we define the set
[TABLE]
Since is an i.i.d. source for each , the set depends only on the type of sequence , and therefore, we may write instead of . Moreover, we define the set
[TABLE]
Then, we have the following lemma:
Lemma III.1
Let denote a mixed memoryless source defined in (3.1), then we have
[TABLE]
Proof:
Since is the expectation of with respect to , Markov’s inequality guarantees that
[TABLE]
from which, together with (3.2), it follows that
[TABLE]
∎
Next, we introduce two lemmas.
Lemma III.2** (Upper Decomposition Lemma)**
Let be a mixed memoryless source and be an arbitrary general source. Then, for any and any real it holds that
[TABLE]
Proof:
Since holds for , we have
[TABLE]
for any . By using this inequality with instead of , we have
[TABLE]
which completes the proof. ∎
Lemma III.3** (Lower Decomposition Lemma)**
Let be a mixed memoryless source and be an arbitrary general source. Then, for any , and it holds that
[TABLE]
Proof:
Setting , we define a set
[TABLE]
for . Then, it holds that
[TABLE]
Thus, for any real number it holds that
[TABLE]
Hence, we obtain the inequality
[TABLE]
from which with instead of it follows that
[TABLE]
for all . This completes the proof. ∎
The first-order -optimum exponent has been derived by using these three lemmas.
Theorem III.1** (First-order -optimum exponent: Han [10])**
For ,
[TABLE]
where denotes the divergence between and .
Proof:
As for the proof, see [10]. ∎
Remark III.1
If is a singleton, the above formula reduces to
[TABLE]
which is nothing but Stein’s lemma [1]. ∎
Remark III.2
* can be expressed also as*
[TABLE]
This can be verified as follows. Set
[TABLE]
Then, clearly . Here, we assume that to show a contradiction. From the assumption, there exists a constant satisfying . On the other hand, from the definition of , for any
[TABLE]
holds. Thus, setting leads to
[TABLE]
which is a contradiction, where the last inequality is due to the definition of . ∎
III-B Second-order -optimum exponent
Next, we derive the second-order -optimum exponent for mixed sources.
Theorem III.2** (Second-order -optimum exponent: Han [9])**
For ,
[TABLE]
where
[TABLE]
[TABLE]
[TABLE]
Remark III.3
If is a singleton , Theorem III.2 reduces to for , which is originally due to Strassen [3].
Proof:
Setting
[TABLE]
it suffices, in view of Theorem II.2, to show two inequalities:
[TABLE]
[TABLE]
Proof of (3.9):
By the definitions of and , it holds that
[TABLE]
where the second equality and the second inequality are due to Lemma III.1 and Lemma III.2, respectively, and the last inequality is from Fatou’s lemma.
Here, we define three sets:
[TABLE]
Noting that, setting ,
[TABLE]
gives the arithmetic average of i.i.d. variables with expectation
[TABLE]
Then, the weak law of large numbers yields that for
[TABLE]
Moreover, for the central limit theorem leads to
[TABLE]
Summarizing these equalities, we obtain
[TABLE]
Plugging (3.20) into (3.11) yields (3.9).
Proof of (3.10):
By definitions of and , and Lemma III.3 with , it holds that
[TABLE]
for any . We also partition the parameter space into three sets as in (3.15) in the proof of (3.9).
Then, similarly to the derivation of (3.18) and (3.19), we obtain
[TABLE]
Thus, the right-hand side of (3.21) is rewritten as
[TABLE]
Substituting (3.23) into (3.21) and noting that is arbitrary, we obtain (3.10). ∎
Remark III.4
From Theorem III.1 with , it is not difficult to verify that
[TABLE]
and
[TABLE]
Here, let us consider the following canonical equation for
[TABLE]
In view of (3.24) and (3.25), this equation always has a solution . It should be noted that if holds, the solution is not unique and so . By using the solution , it is not difficult to check that Theorem III.2 with can be expressed as
[TABLE]
The canonical equation is a useful expression for the second-order -optimum rate [11, 6, 12, 13]. The equation (3.26) is the hypothesis testing counterpart of these results. ∎
IV Mixed memoryless alternative hypothesis
In this section, we consider the case where not only the null hypothesis but also the alternative hypothesis is a mixed memoryless source.
Let be a family of probability distributions on where is a probability space with probability measure . We assume here that is a compact space and is continuous as a function of .
The hypothesis testing problem considered in this section is stated as follows.
- •
The null hypothesis is a mixed memoryless source , that is, for
[TABLE]
where
[TABLE]
- •
The alternative hypothesis is another mixed memoryless source , that is, for
[TABLE]
where
[TABLE]
For simplicity, we may write (resp. ) instead of (resp. ). We assume also that .
Theorem IV.1** (First order -optimum exponent)**
For ,
[TABLE]
where the function is specified by the equation
[TABLE]
and (the essential infimum of with respect to ; is measured with respect to the probability measure ).
Remark IV.1
Notice here that is continuous in . Since we have assumed that is compact and is continuous in , there indeed exists a continuous function satisfying (4.4). ∎
Remark IV.2
In the case that is a singleton, the above theorem coincides with Theorem III.1. Therefore, this theorem is a direct generalization of Theorem III.1. This means that if both and are singletons the theorem coincides with Stein’s lemma. (See, Remark III.1.) ∎
Remark IV.3
Remark III.2 is also valid in this theorem. That is, is expressed also as
[TABLE]
∎
Proof:
In order to show the theorem, let be the set of -typical sequence with respect to , that is, let be the set of all such that
[TABLE]
where is the number of such that , and is an arbitrary constant. Then, it is well known that
[TABLE]
We first derive the upper and lower bounds for the probability
[TABLE]
for any fixed .
In order to upper bound (4.7), we define as
[TABLE]
where denotes the essential supremum of with respect to , i.e., . Thus, from the property of the essential supremum we immediately have
[TABLE]
for .
Let denote the type of . Then, noting that
[TABLE]
holds, is written as
[TABLE]
Here, it is important to notice that is continuous in owing to the assumption and hence is continuous in (the set of probability distributions on ). Thus, expanding in around leads to
[TABLE]
where as , because for .
Then, in view of (4.8) for each we have the upper bound:
[TABLE]
from which it follows that
[TABLE]
for each .
Next, we show the lower bound of . For any and any small constant , set
[TABLE]
then, by the definition of -ess.inf.,
[TABLE]
holds. Our claim is that for any and sufficiently small and with some positive constant
[TABLE]
To see this, consider a sequence such that . Then, there exists a positive integer such that . Otherwise, the continuity of probability measure implies that
[TABLE]
which is a contradiction. On the other hand, in view of (4.11), is equivalent to
[TABLE]
Since and are continuous in around , if is sufficiently small then from (4.13) it follows that
[TABLE]
where we also have used the expansion in around and as . Therefore, all satisfy (4.14). Now we can take so that to have
[TABLE]
Therefore, . Hence, we have
[TABLE]
This is nothing but (4.12).
Thus, again for we have the lower bound
[TABLE]
where in the last equality and in the last inequality we have used the continuity of in around and (4.16), respectively. From (IV), we obtain
[TABLE]
for each .
We now turn to prove the theorem by using (4.10) and (4.18). In view of Theorem II.1 and Remark IV.3, it suffices to show two inequalities:
[TABLE]
[TABLE]
Proof of (4.19):
Similarly to the derivation of (3.11) with Lemma III.2, we have
[TABLE]
From the definition of the -typical set and (4.10), we also have
[TABLE]
for any . Here, we define two sets:
[TABLE]
Then, from the definition of there exists a small constant satisfying
[TABLE]
for . Thus, it holds that
[TABLE]
where we have used the relation , and for sufficiently large and sufficiently small .
Therefore, noting that, with ,
[TABLE]
gives the arithmetic average of i.i.d. variables with expectation
[TABLE]
Then, the weak law of large numbers yields that for ,
[TABLE]
Thus, from (4.23) and (4.24), the right-hand side of (4.21) is upper bounded by
[TABLE]
which completes the proof of (4.19).
Proof of (4.20):
Similarly to the derivation of (3.21) with Lemma III.3, we have
[TABLE]
From the definition of the -typical set and (4.18), we also have
[TABLE]
for any .
We also partition the parameter space into two sets.
[TABLE]
Then, for , if we set and sufficiently small, then there exists a constant satisfying
[TABLE]
Thus, again by invoking the weak law of large numbers, we have for
[TABLE]
Summarizing up, we obtain
[TABLE]
This completes the proof of (4.20). ∎
To illustrate the siginificance of Theorem IV.1, let us now consider the special case with and countably infinite parameter spaces as
[TABLE]
In this case, we can write the null and alternative hypotheses , as , with positive probability weights , . Then, by virtue of Theorem IV.1, we have the following simplified result:
Corollary IV.1
For ,
[TABLE]
Proof:
The formula (4.3) can be written in this case as
[TABLE]
where is uniquely specified by
[TABLE]
because of the assumed closedness of . Let
[TABLE]
then this means that
[TABLE]
Contrarily, let
[TABLE]
then this means that
[TABLE]
As a consequence, (4.31) follows from (4.32), (4.34) and (4.35). ∎
Remark IV.4
One may wonder if it might be possible to deal with the second-order -optimum problem too using the arguments as developed in the above for the first-order -optimum problem with mixed memoryless sources and . To do so, however, it seems that we need some novel techniques, which remains to be studied. ∎
V hypothesis testing with mixed general sources
We have so far considered the -hypothesis testing for mixed memoryless sources. In this section, we consider more general settings such as hypothesis testings with mixed general sources.
To do so, we consider the case where both of null hypothesis and alternative hypothesis are finite mixtures of general sources as follows:
- •
The null hypothesis is a mixed general source consisting of general (not necessarily memoryless) sources , that is, ,
[TABLE]
where and
- •
The alternative hypothesis is another mixed general source consisting of general (not necessarily memoryless) sources , that is, ,
[TABLE]
where and
In this general setting, it is hard to derive a compact formula for the first-order -optimum exponent (for ). Instead, we can obtain the following theorem in the special case of .
Theorem V.1
[TABLE]
In particular, if and are all stationary memoryless sources specified by and , respectively, then
[TABLE]
which is a special case of Corollary IV.1.
Proof:
The proof proceeds in parallel with the argument in [10, Remark 4.4.3]. To be self-contained, we fully describe it in Appendix B. ∎
We can consider the following exponentially -optimum exponent in the hypothesis testing with the two mixed general sources and as above.
Definition V.1
Let be any fixed constant. Rate is said to be exponentially -achievable if there exists an acceptance region such that
[TABLE]
Definition V.2** (First-order exponentially -optimum exponent)**
[TABLE]
Then, it is not difficult to verify that an analogous result to Theorem V.1 holds (which is a generalization of [10, Remark 4.4.3]):
Theorem V.2
[TABLE]
In particular, if the null and alternative hypotheses consist of stationary memoryless sources and , respectively, then
[TABLE]
by virtue of Hoeffding’s theorem. ∎
VI Hypothesis testing with compound sources
In this section, we consider the compound hypothesis testing problem with finite null hypotheses and finite alternative hypotheses , where and are general sources.
The compound hypothesis testing is the problem in which a pair of general sources occurs as a pair (null hypothesis, alternative hypothesis), and the tester does not know which pair is actually working. This means that the acceptance region cannot depend on and . The type I error of the compound hypothesis testing is given by
[TABLE]
for each general null hypothesis . The type II error is also given by
[TABLE]
for each general alternative hypothesis . Then, the following achievability is of our interest.
Definition VI.1
Rate is said to be [math]-achievable for the compound hypothesis testing, if there exists an acceptance region such that
[TABLE]
for all and .
Definition VI.2** (First-order optimum exponent)**
[TABLE]
The following theorem reveals the relationship between the hypothesis testing with mixed general sources as defined in (5.1) and (5.2), and the compound hypothesis testing with the general sources.
Theorem VI.1
Assuming that and hold for all and , it holds that
[TABLE]
where with sources (5.1) and (5.2) we use here the notation
[TABLE]
to denote to make explicit the dependence on , .
Proof:
It suffices to show two inequalities:
[TABLE]
Proof of (6.6): Suppose that is [math]-achievable for the compound hypothesis testing, that is, there exists an acceptance region such that
[TABLE]
Then, the type I error probability for the hypothesis testing with mixed general sources is evaluated as follows. By the definition of and (5.1), we have
[TABLE]
from which, together with (6.8), we obtain
[TABLE]
Similarly, we have
[TABLE]
On the other hand, (6.9) implies
[TABLE]
holds for any and all . Substituting this inequality into (6.11) yields
[TABLE]
Since is arbitrary, from (6.10) and (6.12) we conclude that (6.6) holds.
Proof of (6.7):
Suppose that is [math]-achievable for the mixed hypothesis testing, that is, there exists an acceptance region such that
[TABLE]
We fix such an and set
[TABLE]
Then, from (5.1) we have
[TABLE]
from which, it follows that
[TABLE]
for all . From this inequality and (6.13), we obtain
[TABLE]
for all . Similarly,
[TABLE]
so that we have for ,
[TABLE]
which means that
[TABLE]
Noting that are constants, from (6.14) we obtain
[TABLE]
for all . From (6.15) and (6.16), we conclude that (6.7) holds. ∎
From Theorems V.1 and VI.1, we immediately obtain the first-order [math]-optimum exponent for the compound hypothesis testing as:
Corollary VI.1
Assuming that and hold for all and , we have
[TABLE]
In particular, if and are all stationary memoryless sources specified by and , respectively, (6.17) reduces to
[TABLE]
∎
Remark VI.1
Similarly to Definition V.1, we can define the exponentially -optimum exponent also for the compound hypothesis testing problem as follows.
Definition VI.3
Let be any fixed constant. Rate is said to be exponentially -achievable for the compound hypothesis testing, if there exists an acceptance region such that
[TABLE]
for all and .
Definition VI.4** (First-order exponentially -optimum exponent)**
[TABLE]
Then, using a similar argument to the proof of Theorem VI.1, the following theorem can be shown:
Theorem VI.2
Let and hold for all and , then it holds that
[TABLE]
where we use the notation
[TABLE]
to denote .
Combining Theorems V.2 and VI.2, we immediately obtain the following corollary.
Corollary VI.2
Let and hold for all and , then it holds that
[TABLE]
In particular, if the null and alternative hypotheses consist of stationary memoryless sources specified by and , respectively, as in Theorem V.2, then
[TABLE]
which corresponds to (5.5).
∎
VII concluding remarks
So far, we have considered the first- and second-order -optimum exponents in the hypothesis testing problem. First, we have studied the second-order -optimum problem with mixed memoryless sources. As we have shown in the analysis of the second-order -optimum exponent, we use, as a key property, the asymptotic normality of divergence density rate for each of the component sources. We also observe that the canonical representation, first introduced in [12], is still efficient to express the second-order -optimum exponent for mixed memoryless sources in the hypothesis testing problem.
The *first-*order -optimum exponent in the case with mixed memoryless null and alternative hypotheses has also been established. One may wonder whether we can apply the same approach in the derivation of the *second-*order -optimum exponent in this setting. One of our key techniques to derive the *first-*order -optimum exponent is an expansion around . More careful evaluation of this expansion would be needed to compute the second-order -optimum exponent. This is a future work.
The relationship between the first-order [math]-optimum (resp. exponentially -optimum) exponent in the hypothesis testing with mixed general sources and the [math]-optimum (resp. exponentially -optimum) exponent in the compound hypothesis testing has also been demonstrated.
Appendix A Proof of Lemmas II.1 and II.2
Proof of Lemma II.1:
Since
[TABLE]
holds, we obtain
[TABLE]
which completes the proof of the lemma.
Proof of Lemma II.2:
Define
[TABLE]
Then, it follows that
[TABLE]
for any , where denotes the complement of . The second term on the right-hand side of (A.1) is upper bounded as
[TABLE]
Substituting (A.2) into (A.1), we have the lemma.
Appendix B Proof of Theorem V.1
First, we prove the inequality:
[TABLE]
To do so, we arbitrarily fix for so that
[TABLE]
Then, by the definition of , there exists an acceptance region satisfying
[TABLE]
where and are defined respectively as
[TABLE]
By using these regions, we define the acceptance region as
[TABLE]
Then, we have
[TABLE]
from which, together with (B.3), we obtain
[TABLE]
Similarly, we have
[TABLE]
from which, together with (B.4), we obtain
[TABLE]
Since are arbitrary as far as (B.2) is satisfied, we have (B.1).
Next, we prove the inequality:
[TABLE]
To do so, let be [math]-achievable, then there exists an acceptance region satisfying
[TABLE]
We fix such an and consider the hypothesis testing with null hypothesis and alternative hypothesis for arbitrarily fixed and . Then, probabilities of type I error and type II error are given by
[TABLE]
Since
[TABLE]
we have
[TABLE]
From this inequality and (B.8) we obtain
[TABLE]
Similarly to the derivation of (B.10), we have
[TABLE]
Hence, from (B.9) we obtain
[TABLE]
From (B.11) and (B.12), it follows that is [math]-achievable for the hypothesis testing with against . Noting that are arbitrary with and , we obtain
[TABLE]
This means that (B.7) holds, completing the proof of Theorem V.1. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Dembo and O. Zeitouni, Large Deviations Techniques and Applications . Jones and Bartlett Publishers, Boston, 1993.
- 2[2] P. N. Chen, “General formulas for the Neyman-Pearson type-II error exponent subject to fixed and exponential type-I error bound,” IEEE Trans. Information Theory , vol. 42, no. 1, pp. 316–323, 1996.
- 3[3] V. Strassen, “Asymptotische Abshätzungen in Shannon’s Informations Theorie,” in Trans. 3rd Prague Conf. Inf. Theory , 1962, pp. 687–723.
- 4[4] M. Hayashi, “Information spectrum approach to second-order coding rate in channel coding,” IEEE Trans. Inf. Theory , vol. 55, no. 11, pp. 4947–4966, 2009.
- 5[5] Y. Polyanskiy, H. Poor, and S. Verdú, “Channel coding rate in the finite blocklength regime,” IEEE Trans. Inf. Theory , vol. 56, no. 5, pp. 2307–2359, 2010.
- 6[6] R. Nomura and T. S. Han, “Second-order resolvability, intrinsic randomness, and fixed-length source coding for mixed sources: Information spectrum approach,” IEEE Trans. Inf. Theory , vol. 59, no. 1, pp. 1–16, 2013.
- 7[7] V. Tan and O. Kosut, “On the dispersion of three network information theory problems,” IEEE Trans. Inf. Theory , vol. 60, no. 2, pp. 881–903, 2014.
- 8[8] S. Watanabe, “Second-order region for Gray-Wyner network,” IEEE Trans. Inf. Theory , vol. 63, no. 2, pp. 1006–1018, 2017.
