Exponent Function for Stationary Memoryless Channels with Input Cost at Rates above the Capacity
Yasutada Oohama

TL;DR
This paper investigates the decay rate of correct decoding probability for stationary memoryless channels with input cost at rates exceeding capacity, introducing a recursive method to determine the optimal exponent function.
Contribution
It introduces a recursive technique based on the information spectrum approach to analyze the exponential decay of decoding correctness above capacity.
Findings
Correct decoding probability tends to zero exponentially above capacity.
Determined the optimal exponent function for finite input/output sets.
Developed a new recursive method for analyzing information spectrum quantities.
Abstract
We consider the stationaly memoryless channels with input cost. We prove that for transmission rates above the capacity the correct probability of decoding tends to zero exponentially as the block length of codes tends to infinity. In the case where both of channel input and output sets are finite, we determine the optimal exponent function on the above exponential decay of the correct probability. To derive this result we use a new technique called the recuresive method, which is based on the information spectrum approach. The recursive method utilize a certain recursive structure on the information spectrum quantities.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Exponent Function for Stationary Memoryless Channels
with Input Cost at Rates above the Capacity
Yasutada Oohama
Y. Oohama is with Dept. of Communication Engineering and Informatics, University of Electro-Communications, 1-5-1 Chofugaoka Chofu-shi, Tokyo 182-8585, Japan.
Abstract
We consider the stationary memoryless channels with input cost. We prove that for transmission rates above the capacity the correct probability of decoding tends to zero exponentially as the block length of codes tends to infinity. In the case where both of channel input and output sets are finite, we determine the optimal exponent function on the above exponential decay of the correct probability. To derive this result we use a new technique called the recursive method, which is based on the information spectrum approach. The recursive method utilizes a certain recursive structure on the information spectrum quantities.
Index Terms:
Stationary memoryless channels, Strong converse theorem, Information spectrum approach
I Introduction
A certain class of noisy channels has a property that the error probability of decoding goes to one as the block length of transmitted codes tends to infinity at rates above the channel capacity. This property is called the strong converse property. In the case of DMCs without cost Arimoto [2] proved that the error probability of decoding goes to one exponentially and derived a lower bound of the exponent function. Subsequently, Dueck and Körner [3] determined the optimal exponent function for the error probability of decoding to go to one. They derived the result by using a combinatorial method base on the type of sequences [1]. The equality of the lower bound of Arimoto [2] to that of the optimal bound of Dueck and Körner [3] was proved by the author [4]. A simple derivation of the exponent function in the problem set up of quantum channel coding was given by Nagaoka [5], Hayashi and Nagaoka [6]. In the derivation they used the information spectrum method introduced by Han [7] and a min-max expression of the channel capacity.
In this paper, we determine the optimal exponent function on the correct probability of decoding at rates above capacity for DMCs with input cost. This result can be obtained by a method quite parallel with the method Dueck and Körner [3] used to obtain the optimal exponent function in the case without input cost. Instead of using their method, we use a new method based on the information spectrum method. A main contribution of this paper is that we establish a new powerful method to derive a tight exponent function at rates above the capacity for DMCs. As we mentioned previously, there have been three different methods by Arimoto [2], Dueck and Körner [3] and Nagaoka [5], Hayashi and Nagaoka [6] to derive the result. Our method can be regarded as the fourth new method, having the following two merits:
Our method and the method of Nagaoka [5], Hayashi and Nagaoka [6] are based on the information spectrum method. Those two methods have a common advantage that they also work for the derivation of the exponent function for general memoryless channels(GMCs), where the channel input and outputs are real lines. On the other hand, the method of type used by Dueck and Körner [3] only works for DMCs where channel input and output sets are finite.
- 2.
The recursive method is a general powerful tool to prove strong converse theorems for several coding problems in information theory. In fact, this method played important roles in deriving exponential strong converse exponent for communication systems treated in [8]-[12].
By the first merit, we derive a lower bound of the optimal exponent function for GMCs. This lower bound is thought to be useful for deriving explicit lower bounds of the optimal exponent functions for several examples of GMCs.
II
Capacity Results for the Discrete Memoryless Channels with Input Cost
We consider a stationary discrete memoryless channel(DMC) with the input set and the output set . We assume that and are finite sets. A case where and are real lines will be treated in Section VI.
The SDMC is specified by the following stochastic matrix:
[TABLE]
Let be a random variable taking values in . We write an element of as x^{n}=x_{1}x_{2}$$\cdots x_{n}. Suppose that has a probability distribution on denoted by . Similar notations are adopted for other random variables. Let be a random variable obtained as the channel output by connecting to the input of channel. We write a conditional distribution of on given as
[TABLE]
Since the channel is memoryless, we have
[TABLE]
Let be uniformly distributed random variables taking values in message sets .
The random variable is a message sent to the receiver. A sender transforms into a transmitted sequence using an encoder function and sends it to the receiver. In this paper we assume that the encoder function is a deterministic encoder. In this case, is a one-to-one mapping from into . The joint probability mass function on is given by
[TABLE]
where , are the -th components of and is a cardinality of the set . The decoding function at the receiver is denoted by . This function is formally defined by Let be a cost function. The average cost on output of must not exceed . This condition is given by , where
[TABLE]
The average error probabilities of decoding at the receiver is defined by
[TABLE]
For , set The families of sets is called the decoding regions. Using the decoding region, can be written as
[TABLE]
Set
[TABLE]
The quantity is called the average correct probability of decoding. This quantity has the following form
[TABLE]
For given , is -achievable under if for any , there exist a positive integer and a sequence of pairs such that for any ,
[TABLE]
The supremum of all -achievable under is denoted by . We set
[TABLE]
which is called the channel capacity. The maximum error probability of decoding is defined by as follows:
[TABLE]
Based on this quantity, we define the maximum capacity as follows. For a given , {\color[rgb]{0,0,0}R\normalcolor} is -achievable under , if for any , there exist a positive integer and a sequence of pairs such that for any ,
[TABLE]
The supremum of all -achievable rates under is denoted by C_{{\color[rgb]{0,0,0}\rm m\normalcolor},{\rm DMC}}(\varepsilon,\Gamma|{\color[rgb]{0,0,0}W\normalcolor}). We set
[TABLE]
which is called the maximum capacity of the DMC. Set
[TABLE]
where is a set of probability distribution on and stands for a mutual information between and when input distribution of is . The following is a well known result.
Theorem 1
For any DMC , we have
[TABLE]
Han [7] established the strong converse theorem for DMCs with input cost. His result is as follows.
Theorem 2** (Han [7])**
If , then for any satisfying
[TABLE]
we have
[TABLE]
The following corollary immediately follows from this theorem.
Corollary 1
For each fixed and any DMC , we have
[TABLE]
To examine an asymptotic behavior of for large at , we define the following quantities:
[TABLE]
On the above exponent functions, we have the following property.
Property 1
**
- a)
By definition we have that for each fixed , is a monotone increasing function of and satisfies .
- b)
The sequence of exponent functions satisfies the following subadditivity property:
[TABLE]
from which we have that exists and is equal to .
- c)
For fixed , the function is a monotone decreasing function of . For fixed , the function a monotone increasing function of and satisfies
[TABLE]
- d)
The function is a convex function of .
Proof of Property 1 is given in Appendix -A.
III Main Result
In this section we state our main result. Define
[TABLE]
where is the set of joint probability distributions on , , and
[TABLE]
Using the standard method developed by Csiszár and Körner [1], we can prove the following theorem.
Theorem 3
For any ,
[TABLE]
Proof of this theorem is given in Appendix -B. Let The case corresponds to the case without cost. In this case Dueck and Körner [3] show that
[TABLE]
They derived the bound by using a combinatorial method based on the type of sequences. Our method to prove Theorem 3 is different from their method since we do not use a particular structure of types.
We next derive a lower bound of . To this end we define several quantities. Define
[TABLE]
Our main result is the following.
Theorem 4
For any DMC , we have
[TABLE]
Proof of this theorem will be given in Section IV. Arimoto [2] derived a lower bound of , which we denote by . To describe this exponent function we define some functions. For , define
[TABLE]
Furthermore, set
[TABLE]
Then we have the following proposition.
Proposition 1
For any DMC and for any , we have the following:
[TABLE]
In particular, we have
[TABLE]
Proof of this proposition is given in Section V. We next state a relation between and . To this end we present a lemma stating that has two parametric expressions. For , we define
[TABLE]
For , we define
[TABLE]
Then we have the following lemma.
Lemma 1
For any , we have
[TABLE]
For any , any , we have
[TABLE]
The two equalities (13) and (14) imply that
[TABLE]
Proof of this lemma will be given in Appendix -C. The following proposition states that the two quantities and match.
Proposition 2
For any , we have the following:
[TABLE]
In particular, we have
[TABLE]
Proof of this proposition is given in Section V. From Theorems 3, 4 and Propositions 1, 2, we immediately obtain the following theorem.
Theorem 5
For any DMC , we have
[TABLE]
IV Proof of the Results
We first prove the following lemma.
Lemma 2
For any and for any satisfying we have
[TABLE]
In (22) we can choose any probability distribution on .
Proof : For , set
[TABLE]
Let stand for . Then we have the following:
[TABLE]
where
[TABLE]
On the quantity , we have
[TABLE]
Step (a) follows from the definition of . Step (b) follows from Step (c) follows from Hence it suffices to show to prove Lemma 2. We have the following chain of inequalities:
[TABLE]
Step (a) follows from that for every , we have . ∎
From Lemma 2, we have the following lemma
Lemma 3
For any and for any satisfying we have
[TABLE]
Proof: In (22) in Lemma 2, we choose having the form
[TABLE]
Then from the bound (22) in Lemma 2, we obtain
[TABLE]
completing the proof. ∎
We use the following lemma, which is well known as the Cramèr’s bound in the large deviation principle.
Lemma 4
For any real valued random variable and any , we have
[TABLE]
Here we define a quantity which serves as an exponential upper bound of . Let be a set of all probability distributions on having the form:
[TABLE]
For simplicity of notation we use the notation for . For and , we define
[TABLE]
By Lemmas 3 and 4, we have the following proposition.
Proposition 3
For any , any , and any satisfying we have
[TABLE]
for some and for any .
Proof: Under the condition we have the following chain of inequalities:
[TABLE]
Step (a) follows from Lemma 3. Step (b) follows from Lemma 4. We choose so that
[TABLE]
Solving (55) with respect to , we have
[TABLE]
For this choice of and (54), we have
[TABLE]
completing the proof. ∎
Set
[TABLE]
By the above definition of and Proposition 3, we have
[TABLE]
Then from (56), we obtain the following corollary.
Corollary 2
For any , we have
[TABLE]
We shall call the communication potential. The above corollary implies that the analysis of \overline{\Omega}^{({\color[rgb]{0,0,0}\mu,\normalcolor}\lambda)}($$W) leads to an establishment of a strong converse theorem for the DMC. In the following argument we drive an explicit upper bound of \overline{\Omega}^{({\color[rgb]{0,0,0}\mu,\normalcolor}\lambda)}(W). For each , define the function of by
[TABLE]
For each , we define the probability distribution
[TABLE]
by
[TABLE]
where
[TABLE]
are constants for normalization. For each , set
[TABLE]
where we define . Then we have the following lemma.
Lemma 5
[TABLE]
Proof: From (57) we have
[TABLE]
Furthermore, by definition we have
[TABLE]
From (59) and (60), (58) is obvious. ∎
The following lemma is useful for the computation of for .
Lemma 6
For each , and for any we have
[TABLE]
Furthermore, we have
[TABLE]
*Proof: * By the definition of {p}_{{}X^{t}Y^{t};Q^{t}}^{(\mu,\lambda)}$$($$x^{t},y^{t}), , we have
[TABLE]
Then we have the following chain of equalities:
[TABLE]
Steps (a) and (b) follow from (63). From (64), we have
[TABLE]
Taking summations of (66) and (66) with respect to , we obtain
[TABLE]
completing the proof. ∎
We set
[TABLE]
Then by (62) in Lemma 6 and the definition of , we have
[TABLE]
The following proposition is a mathematical core to prove our main result.
Proposition 4
For any , we have
[TABLE]
Proof: We first observe that by (58) in Lemma 5 and (67), we have
[TABLE]
In (69), we set Note that is a function of . We define a joint distribution on by
[TABLE]
Then we have
[TABLE]
We define recursively. For each , we choose so that it minimizes Let be one of the minimizes on the above optimization problem. We set . Note that can be determined recursively depending on the previous minizers . Then we have the following:
[TABLE]
Hence we have the following:
[TABLE]
Step (a) follows from (68). Step (b) follows from (70). Since (71) holds for any and for any , we have
[TABLE]
completing the proof. ∎
Proof of Theorem 4: From Corollary 2 and Proposition 4, we have \geq$$G^{(\mu,\lambda)}(R,\Gamma|W) for any . Hence we have the bound G^{*}(R,\Gamma|W)$$\geq$$G(R,\Gamma|W). ∎
V Equivalence of Three Exponent Functions
In this section we prove Propositions 1 and 2 stated in Section III. We first prove Proposition 1. The following is a key lemma to prove this proposition.
Lemma 7
For any
[TABLE]
The distribution attaining (1+\lambda)J^{(\mu,\frac{\lambda}{1+\lambda})}($$q_{X}|W) is given by
[TABLE]
where is a constant for normalization, having the form
[TABLE]
Proof: We observe that
[TABLE]
On the objective function of the minimization problem inside the logarithm function in (73), we have the following chain of inequalities:
[TABLE]
In (a), we have used the reverse Hölder inequality
[TABLE]
which holds for nonegative and for such that either or . In our case we have applied the inequality to
[TABLE]
In the reverse Hölder inequality the equality holds if and only if for some constant . In (74), the equality holds for
[TABLE]
where is a normalized constant. From (74), we have
[TABLE]
completing the proof. ∎
Proof of Proposition 1: The equality (9) in Proposition 1 immediately follows from Lemma 7. Using (9), we prove . We have the following chain of inequalities:
[TABLE]
Step (a) followns from (9) in Proposition 1. ∎
We next prove Proposition 2. We can show that and satisfies the following property.
Property 2
**
- a)
The function is monotone increasing function of and is positive if and only if .
- b)
For , set
[TABLE]
Then, for , necessary and sufficient conditions on the probability distribution that minimizes is
[TABLE]
for any with equality if .
We now proceed to the proof of Proposition 2.
Proof of (16) in Proposition 2: We prove . For a given joint distribution
[TABLE]
we introduce the stochastic matrix (x,y)∈X×Y and the probability distribution y∈Y by
[TABLE]
The above is called a backward channel. Using , we obtain the following chain of equalities:
[TABLE]
where (x,y)∈X×Y is a stochastic matrix whose components are
[TABLE]
and is a probability distribution whose components are
[TABLE]
Hence, by (75) and the non-negativity of divergence, we obtain
[TABLE]
for any . Next, we prove
[TABLE]
To this end it suffices to show that for any ,
[TABLE]
Let be a probability distribution that attains the minimum of . Then, by Property 2, we have
[TABLE]
for any with equality if . For with and , define the matrix (x,y)∈X×Y by
[TABLE]
By (76) and (77), each has the following form:
[TABLE]
Taking summation of both sides of (80) with respect to and taking (78) into account, we obtain
[TABLE]
The above equality implies that is a stochastic matrix. Furthermore, note that from (79),
[TABLE]
Then, choosing in (75), we have, for ,
[TABLE]
completing the proof. ∎
We prove (17) in Proposition 2 by (16).
Proof of (17) in Proposition 2: We prove . Let be an input distribution attaining . Then, by the definition of , we have
[TABLE]
for any . Hence we have
[TABLE]
Then we have the following chain of inequalities:
[TABLE]
Step (a) followns from (16) in Proposition 2. Step (b) follows from for any . ∎
VI Extention to Genaral Memoryless Channels
In this section we consider a stationary general memoryless channel(GMC), where and are real lines. The GMC is specified with a noisy channel . We assume that for each X=x$$\in{\cal X}, has a conditonal density function . Except for Theorem 3, Property 2 part b), and Proposition 2, the results we have presented so far also hold for this general case. Let be a probability measure on having the density . Let be a probability measure on having the density . In the case of GMC, the definitions of and are
[TABLE]
For GMC , we define the exponent functions and in a manner similar to the definitions of those exponent functions in the case of DMC. The following theorem is a generalization of Theorem 4 to the case of GMC.
Theorem 6
For any GMC , we have
[TABLE]
We next describe a lemma which is a generalization of Lemma 7 to the case of GMC. For , define
[TABLE]
Then we have the following lemma.
Lemma 8
For any probability densitity function on , we have
[TABLE]
The probability density function attaining is given by
[TABLE]
where is a constant for normalization, having the form
[TABLE]
For GMC , we define the exponent functions and in a manner similar to the definitions of those exponent functions in the case of DMC. From Lemma 8, we have the following proposition, which is a generalization of Proposition 1 to the case of GMC.
Proposition 5
For any GMC and for any , we have the following:
[TABLE]
In particular, we have
[TABLE]
From Theorem 6 and Proposition 5, we immediately obtain the following result.
Theorem 7
For any GMC , we have
[TABLE]
Theorem 3 is related to the upper bound of . Proof of this theorem depends heavily on a finiteness of . We have no result on the upper bound of and the tightness of the bound . In the case of GMC, and are not computable since those are variational problems. On the other hand, has a min-max expression. In [13], the author succeeded in obtaining an explicit form of for additive white Gaussian noise channels(AWGNs) by utilizing the min-max property of .
-A *
General Properties on *
In this appendix we prove Property 1 describing general properties on .
Proof of Property 1: By definition it is obvious that for fixed , is a monotone increasing function of and that for fixed , is a monotone increasing function of . We prove the part b). By time sharing we have that
[TABLE]
The part b) follows by letting and in (87). We next prove the part c). By definition it is obvious that for fixed , is a monotone decreasing function of and that for fixed , is a monotone increasing function of . It is obvious that the worst pair of is that for , the decoder always outputs a constant message . In this case we have
[TABLE]
Hence we have in the part c). We finally prove the part d). Let be an integer part of . Fix any . Let . We choose so that
[TABLE]
For this choice of and , we have
[TABLE]
Fix small positive arbitrary. Then, for any
[TABLE]
we have the following chain of inequalities:
[TABLE]
Step (a) follows from the part a) and
[TABLE]
Step (b) follows from the part a). Step (c) follows from (87). Step (d) follows from (88). Letting in (89), we have
[TABLE]
where can be taken arbitrary small. We choose , , and , as
[TABLE]
For the above choice of , , and , we have
[TABLE]
Then we have the following chain of inequalities:
[TABLE]
Step (a) follows from (92). Step (b) follows from (90). Step (c) follows from (7). Step (d) follows from (91). For any positive , we have the following chain of inequalities:
[TABLE]
Step (a) follows from (93). Step (b) follows from (90). Since can be taken arbitrary small in (94), we have
[TABLE]
which implies the convexity of on . ∎
-B Proof of Theorem 3
In this appendix we prove Theorem 3. We first describe some definitions necessary for the proof. For , set
[TABLE]
The probability distribution on is called the type of sequences in . Let be a set of all types of sequences in . Let be a set of all conditional distributions on for given . We fix . We consider any pair satisfying . For such pair of , we can construct an -length block code with message set satisfying:
- a)
.
- b)
all codewords have the identical type .
- c)
.
By the condition b), we have Hence the -length block code satisfies the cost constraint. Furthermore, by this condition we can obtain the following result.
Lemma 9
For every , we have
[TABLE]
Proof: For each , we set
[TABLE]
For each , we have the following chain of equalities:
[TABLE]
Step (a) follows from the memoryless property of the noisy channel. Step (b) follows from that . ∎
For , we set
[TABLE]
Furthermore, set
[TABLE]
The quantity has a lower bound given by the following Lemma.
Lemma 10
For any , we have
[TABLE]
Here we set and stands for a binary entropy function.
Proof: We have the following chain of inequalities:
[TABLE]
Step (a) follows from Lemma 9. Steps (b) and (c) follow from the log-sum inequality. Step (d) follows from that
[TABLE]
and From (97), we obtain
[TABLE]
completing the proof. ∎
Proof of Theorem 3: We first consider the case where . In this case we choose . Then we have
[TABLE]
Step (a) follows from the condition . Step (b) follows from that
[TABLE]
We next consider the case where . Consider the new message set satisfying . For new message set , we define such that if . For , we define arbitrary sequence of having the type . We use the same decoder as that of the message set . Then we have the following:
[TABLE]
Step (a) follows from (96) in Lemma 10. Step (b) follows from Step (c) follows from . Combining (98) and (99), we have
[TABLE]
for any with and . Hence from (100), we have
[TABLE]
The quantity appearing in the last inequality is an error bound coming from an approximation of the marginal distribution of achieving by some suitable type Since can be made arbitrary close to by letting sufficiently large, we can choose so that as . We further note that as . Hence by letting in (101), we obtain
[TABLE]
Since can be made arbitrary small, we conclude that ∎
-C Proof of Lemma 1
In this appendix we prove Lemma 1. We can show that satisfies the following property.
Property 3
**
- a)
For every fixed , the function is monotone increasing for and takes positive value if and only if . For every fixed , the function is monotone decreasing for
- b)
* is a convex function of .*
- c)
For
[TABLE]
Property 3 part a) is obvious. Proof of the part b) is found in Appendix -D. Proof of part c) is quite similar to that of the case without input cost given by Dueck and Körner [3]. We omit the detail.
We can show that satisfies the following property.
Property 4
**
- a)
For every fixed , the function is monotone increasing for . For every fixed , the function is monotone decreasing for
- b)
For every fixed , the function is a convex function of .
- c)
For
[TABLE]
Property 4 part a) is obvious. Proof of the part b) is found in Appendix -E. Proof of part c) is quite similar to that of the case without input cost given by Dueck and Körner [3]. We omit the detail.
*Proof of (13) in Lemma 1: * From its formula, it is obvious that for any
[TABLE]
Hence it suffices to prove that for any , there exists such that
[TABLE]
By Property 3 part b), is a monotone decreasing and convex function of . Then, there exists such that for any , we have
[TABLE]
Fix the above . Let be a joint distribution that attains . Set . By the definition of , we have
[TABLE]
Then, we the following chain of inequalities:
[TABLE]
Step (a) follows from (103). Step (b) follows from (104). Step (c) follows from the choice of . It follows from (105) that for , (102) holds for some . This completes the proof. ∎
*Proof of (14) in Lemma 1: * Since for any and any , it is obvious that
[TABLE]
Hence it suffices to prove that for , there exists such that By Property 4 part b) is a monotone increasing and convex function of . Then, by Property 3 part c), there exists such that for any , we have
[TABLE]
Let be a joint distribution that attains . Set . Then we have the following chain of inequalities:
[TABLE]
Step (a) follows from (106). Step (b) follows from the choice of . ∎
-D Proof of Property 3 part b)
Proof of Property 3 part b): We first observe that
[TABLE]
where we set
[TABLE]
For each , let be a probability distribution that attains . By definition we have
[TABLE]
For , we set where The quantities and are probability and conditional probability distributions induced by . Set By the linearity of with respect to , we have that
[TABLE]
By the convex property of and with respect to , we have that
[TABLE]
Set . We have the following two chains of inequalities:
[TABLE]
Steps (a) and (c) follow from (111). Steps (b) and (d) follow from the definition of From (-D) and (113), we have that
[TABLE]
Thus we have the following chain of inequalities
[TABLE]
Step (a) follows from (110). Step (b) follows from (114). Step (c) follows from (109). ∎
-E Proof of Property 4 part b)
Proof of Property 4 part b): We set
[TABLE]
Then we have
[TABLE]
For each , let be a probability distribution that attains . By definition we have
[TABLE]
For , we set where The quantities and are probability and conditional probability distributions induced by . By the convex property of
[TABLE]
with respect to , we have that
[TABLE]
Then we have the following two chains of inequalities:
[TABLE]
Steps (a) and (c) follow from (116). Steps (b) and (d) follow from the definition of From (-E) and (118), we have that
[TABLE]
Thus we have the following chain of inequalities
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] I. Csiszár and J. Körner, Information Theory: Coding Theorems for Discrete Memoryless Systems. Academic Press, New York, 1981.
- 2[2] S. Arimoto, “On the converse to the coding theorem for discrete memoryless channels,” IEEE Trans. Inform. Theory, vol. IT-19, no. 3, pp. 357–359, May 1973.
- 3[3] G. Dueck and J. Körner, “Reliability function of a discrete memoryless channel at rates above capacity,” IEEE Trans. Inform. Theory , vol. IT-25, no. 1, pp. 82–85, 1979.
- 4[4] Y. Oohama, “On two strong converse theorems for discrete memoryless channels,” IEICE Trans. Fundamentals , vol. 98, no. 12, pp. 2471–2475, 2015.
- 5[5] H. Nagaoka, “Strong converse theorems in quantum information theory,” Proceedings of ERATO Workshop on Quantum Information Science , p. 33, 2001.
- 6[6] M. Hayashi and H. Nagaoka, “General formulas for capacity of classical-quantum channels”, IEEE Trans. Inform Theory , vol. 49, no. 7, pp. 1753–1768, 2003.
- 7[7] T. S. Han, Information-Spectrum Methods in Information Theory. Springer-Verlag, Berlin, New York, 2002. The Japanese edition was published by Baifukan-publisher, Tokyo, 1998.
- 8[8] Y. Oohama, “Exponent function for one helper source coding problem at rates outside the rate region,” Proceedings of the 2015 IEEE International Symposium on Information Theory , pp. 1575–1579, Hong Kong, China, June 14-19, 2015.
