On Achievable Rates of AWGN Energy-Harvesting Channels with Block Energy Arrival and Non-Vanishing Error Probabilities
Silas L. Fong, Vincent Y. F. Tan, and Ayfer \"Ozg\"ur

TL;DR
This paper characterizes the achievable communication rates of an AWGN energy-harvesting channel with block energy arrivals, providing first- and second-order asymptotic expansions and proposing adaptive strategies for different energy arrival regimes.
Contribution
It offers the first comprehensive analysis of the Δ-capacity and second-order terms for AWGN EH channels with block energy arrivals, introducing an adaptive save-and-transmit strategy.
Findings
First-order Δ-capacity characterized for various block lengths.
Second-order term scales as â(L/n) for fixed or sublinear L.
Adaptive save-and-transmit strategy improves performance for linear L growth.
Abstract
This paper investigates the achievable rates of an additive white Gaussian noise (AWGN) energy-harvesting (EH) channel with an infinite battery. The EH process is characterized by a sequence of blocks of harvested energy, which is known causally at the source. The harvested energy remains constant within a block while the harvested energy across different blocks is characterized by a sequence of independent and identically distributed (i.i.d.) random variables. The blocks have length , which can be interpreted as the coherence time of the energy arrival process. If is a constant or grows sublinearly in the blocklength , we fully characterize the first-order term in the asymptotic expansion of the maximum transmission rate subject to a fixed tolerable error probability . The first-order term is known as the -capacity. In addition, we obtain lower andâŠ
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On Achievable Rates of AWGN Energy-Harvesting Channels with Block Energy Arrival and Non-Vanishing Error Probabilities
Silas L. Fong, Vincent Y. F. Tan, and Ayfer ĂzgĂŒr S. L. Fong is with the Department of Electrical and Computer Engineering, National University of Singapore, Singapore 117583 (e-mail: [email protected]). V. Y. F. Tan is with the Department of Electrical and Computer Engineering, National University of Singapore, Singapore 117583, and also with the Department of Mathematics, National University of Singapore, Singapore 119076 (e-mail: [email protected]).A. ĂzgĂŒr is with the Department of Electrical Engineering, Stanford University, CA 94305, USA (email: [email protected]).
Abstract
This paper investigates the achievable rates of an additive white Gaussian noise (AWGN) energy-harvesting (EH) channel with an infinite battery. The EH process is characterized by a sequence of blocks of harvested energy, which is known causally at the source. The harvested energy remains constant within a block while the harvested energy across different blocks is characterized by a sequence of independent and identically distributed (i.i.d.) random variables. The blocks have length , which can be interpreted as the coherence time of the energy arrival process. If is a constant or grows sublinearly in the blocklength , we fully characterize the first-order term in the asymptotic expansion of the maximum transmission rate subject to a fixed tolerable error probability . The first-order term is known as the -capacity. In addition, we obtain lower and upper bounds on the second-order term in the asymptotic expansion, which reveal that the second order term scales as for any less than . The lower bound is obtained through analyzing the save-and-transmit strategy. If grows linearly in , we obtain lower and upper bounds on the -capacity, which coincide whenever the cumulative distribution function (cdf) of the EH random variable is continuous and strictly increasing. In order to achieve the lower bound, we have proposed a novel adaptive save-and-transmit strategy, which chooses different save-and-transmit codes across different blocks according to the energy variation across the blocks.
Index Terms:
Achievable rates, block energy arrival, energy-harvesting, non-vanishing error probabilities, save-and-transmit
I Introduction
The energy-harvesting (EH) channel consists of one source equipped with an energy buffer (also called battery), and one destination. For simplicity, we assume in this paper that the buffer has infinite capacity. At each discrete time , a random amount of energy arrives at the buffer and the source transmits a symbol such that
[TABLE]
This implies that the total harvested energy must be no smaller than the energy of the codeword at every discrete time for transmission to take place successfully. The knowledge of is available at the source at time before encoding , and the destination has no access to the energy-arrival process.
We assume that arrive at the buffer in a block-by-block manner as follows: For each , let
[TABLE]
such that is the index of the first channel use within the block of energy arrival, where denotes the length of each block. In other words, the block of energy arrival starts at the channel use. The EH random variables that mark the starting points of the blocks (i.e., ) are assumed to be independent and identically distributed (i.i.d.) random variables where111If the constraint is replaced with the less stringent one , all the achievability results in this paper continue to hold. In fact, the only place that requires is the use of the Berry-Esséen theorem in Section VI-B in the course of proving the converse of Theorem 1. and for some . A large class of distributions of practical interests satisfy the third-moment assumption including those with well-defined moment generating function. In addition, we assume
[TABLE]
for all . In other words, the harvested energy in each channel use within a block remains constant while the harvested energy across different blocks is characterized by a sequence of i.i.d. random variables with mean equal to . This block-by-block energy-arrival assumption is useful for modeling practical scenarios when the energy-arrival process evolves at a slower timescale compared to the transmission process [1, Sec. II-C]. This is often the case for most natural energy processes, such as solar energy or wind energy. The block i.i.d. EH process can model, for example, a solar panel which harvests energy from the sun, and the appearance of clouds can change randomly and block certain amounts of sunshine for a certain period of time. Similarly, this is a good model for a device which harvests RF energy from other transmitting devices in its environment. Such transmitting devices typically transmit continuously for certain periods of time and are silent for the remaining periods (as in TDMA, for example), which warrants a block i.i.d. model. Most importantly, the block i.i.d. model provides a simple way to study the impact of correlations in the EH process on the system capacity. Such block i.i.d. models are popularly used in wireless communication as a means to capture correlations in the channel fading process by a simple model. In that context, the block length  is called the coherence time of the channel, which corresponds to the time duration over which the channel remains approximately constant [2, Sec. 2.3]. Analogously, we refer to as the coherence time of the energy arrival process.
The channel noise of the EH channel is modeled as an additive white Gaussian noise (AWGN), which is described as follows. In each time slot , after the source has transmitted , the destination receives
[TABLE]
where are i.i.d. standard normal random variables. The above EH channel is referred to as the AWGN EH channel. It was shown by Ozel and Ulukus [3] that the capacity of the AWGN EH channel for the case is
[TABLE]
where is the expectation of the harvested energy for each energy arrival. In this paper, we assume that can grow with and would like to investigate how the growth rate of affects the first- and second-order terms of the asymptotic expansion of the maximum transmission rate. The first-order term is also known as the -capacity [4, Sec. 3.4], and the second-order term divided by an appropriate scaling (which is in many cases including the AWGN channel) is known as the second-order coding rate [5]. The following two cases regarding the growth rate of will be investigated in this paper:
- (i)
is a constant or grows sublinearly in . The latter statement means
[TABLE] 2. (ii)
grows linearly in .
Note that in practice and are two independent parameters. The first one is dictated by the nature of the EH process and the second one is a design parameter typically dictated by the delay and reliability requirements of the application and the complexity constraints at the transmitter and the receiver. Depending on how fast the EH process changes over time, can be significantly smaller than or comparable to . In order to reveal the impact of the interplay between these two parameters on the second-order term, we couple these parameters in different ways, say , and study the limiting case when both and approach for different couplings, i.e., different values of in where captures how large is with respect to . This allows us to identify how and together impact the second-order term. In particular, we conclude that it is the ratio of the two that determines the second-order term. The impact of correlation in the EH process can be interpreted as effectively decreasing the blocklength by a factor of . Note that keeping constant while taking to infinity, i.e., considering only the special case , would lead to a degenerate regime where the correlation in the EH process does not play a significant role. The approach we take here has been extensively used in the wireless information theory literature to obtain asymptotic results in multi-parameter problems where the problem involves multiple independent parameters that can be large or small with respect to each other. (See for example the notion of generalized degrees of freedom in [6] and follow-up work, or Section 3.1 of [7] for a detailed discussion of a similar formulation in the context of scaling laws for wireless networks.)
I-A Main Contribution
We use , , and denote standard asymptotic Bachmann-Landau notations except our convention that they must be non-negative. The contributions of this paper are summarized in the following:
Case (i): When is a constant or grows sublinearly inÂ
First, we prove an achievable finite blocklength bound based on the save-and-transmit strategy of [3]. During the saving phase of the save-and-transmit strategy, energy is saved for a certain number of time slots. During this period, no information is transmitted. Subsequently, during the transmission phase, the source uses a Gaussian codebook to send information. In order to analyze the performance of this save-and-transmit strategy, we construct a single sequence of random variables that characterizes the probability of the available energy being insufficient to support the Gaussian codeword (i.e., for all ) and derive a concentration bound related to the random sequence. Our analysis reveals that the backoff from capacity for the optimal length- code with error probability less than is no larger than O\Big{(}\sqrt{\frac{L}{n}}\Big{)}. More specifically, the maximum alphabet size of the message we can transmit over channel uses with average probability of error no larger than , denoted by , satisfies
[TABLE]
where is some constant that does not depend on . We also identify the implied constant in Theorem 1 in Section III. The qualitative interpretation of will be given in Remark 3 in Section III-C. The lower bound (7) is obtained by choosing the lengths of the saving phase and transmission phase to be and respectively, which is illustrated in Figure 1 where the accumulated harvested energy is always above the accumulated transmitted energy due to the EH constraints (1). 2. 2.
Second, we prove a non-asymptotic upper bound on achievable rates by simplifying the type-II error of a carefully chosen binary hypothesis test. The first-order term of the upper bound is and the second-order term is proportional to for all . More specifically, for all , we have
[TABLE]
where is some constant that does not depend on . We also identify the implied constant in Theorem 1 in Section III. The qualitative interpretation of will be given in Remark 4 in Section III-C. Note that (7) and (8) together reveal that the back-off from for the optimal length- code with error probability less than is of the order . Therefore, the impact of correlation in the EH process can be interpreted as effectively decreasing the blocklength by a factor of . In other words, to achieve the same reliability, one needs to increase the blocklength by a factor equal to the coherence time of the EH process.
It is readily seen from (7) and (8) that for any fixed , the -capacity is and the second-order term in the asymptotic expansion for the maximum achievable rate is proportional to . In addition, define
[TABLE]
to be the second-order coding rate [5]. We can see from (7) and (8) that for any fixed , the second-order coding rate is sandwiched between and .
Case (ii): When grows linearly inÂ
First, we prove a lower bound on the -capacity, as shown in Theorem 2 in Section III, based on a modified version of the save-and-transmit strategy called the adaptive save-and-transmit strategy. Under the adaptive save-and-transmit strategy which is described in Section VII-A, different save-and-transmit codes are used across different blocks. In each block , the coding rate is adapted to the corresponding EH random variable so that it is close to . In addition, the lengths of the saving phase and transmission phase for block are chosen to be and respectively as illustrated in Figure 2. 2. 2.
Second, we prove an upper bound on the -capacity (Theorem 2). We do so by considering a typical set of sequences of EH random variables followed by simplifying the type-II error of a binary hypothesis test conditioned on the aforementioned typical set.
For any EH process whose EH random variable has a continuous and strictly increasing cumulative density function (cdf), the upper and lower bounds in Theorem 2 coincide and hence the -capacity is fully characterized. See Remark 6 in Section III-C for a detailed discussion. Case (ii) is useful for modeling the scenario where the energy-harvesting rate changes slowly such that the number of energy-arrival blocks stays constant as increases. Since the number of energy-arrival blocks stays constant and the length of each energy-arrival block grows with , it is first-order optimal to choose an appropriate save-and-transmit scheme that achieves the maximum coding rate for each block according to the energy level in that block. Therefore, we need an adaptive save-and-transmit scheme rather than the conventional non-adaptive one to achieve the overall maximum coding rate.
I-B Related Work
The channel capacity was characterized for the AWGN channel with an i.i.d. EH process in [3] and with a stationary ergodic EH process in [8]. The aforementioned studies showed that with an unlimited battery, the capacity of the AWGN channel with stochastic energy constraints is equal to the capacity of the same channel under an average power constraint as long as the average power equals the average recharge rate of the battery. In this paper, we focus on the AWGN channel with a block EH process, where the energy arrivals remain constant for a block of duration and are independent across blocks drawn from an arbitrary distribution. A similar block i.i.d. EH model has been recently considered in [9, 10] concurrently with the current paper. However, these papers focus on the power control problem for EH communications with finite battery at the transmitter. In this paper, we rather consider the information-theoretic capacity of the channel and with infinite battery at the transmitter. Characterizing the information theoretic capacity of the channel with a finite battery is known to be a difficult problem even for an i.i.d. model for the energy arrivals and in general remains an open problem. It has been studied in several recent works [11, 12, 13, 14] and the most recent ones [13, 14] characterize the capacity within a constant gap. Due to the lack of a complete characterization of the capacity under a finite battery assumption, in this paper we focus on the AWGN EH channel with infinite battery and develop bounds on the first- and second-order terms in the asymptotic expansion of the maximum transmission rate.
For a fixed tolerable error probability , Fong et al. [15] recently performed a finite blocklength analysis of save-and-transmit schemes proposed in [3] and obtained a non-asymptotic achievable rate for the AWGN channel with an i.i.d. EH process. The first-, second- and third-order terms of the non-asymptotic achievable rate presented in [15, Th. 1] are equal to the capacity, and respectively where and are some positive constants that do not depend on and . Subsequently, Shenoy and Sharma [16] refined the analysis in [15] and improved the second-order term to where is some positive constant that does not depend on and . This paper further improves the second-order term to for any where is some positive constant that does not depend on and (see Remark 2). The aforementioned improvements are due to better analyses of the âenergy outageâ probability for the same save-and-transmit strategy, where the âenergy outageâ occurs when the source cannot output the desired codeword due to energy shortage.
I-C Paper Outline
This paper is organized as follows. The notation used in this paper is described in the next subsection. Section II states the formulation of the AWGN EH channel with block energy arrival. Section III presents our two main results â the first result fully characterizes the -capacity and provides lower and upper bounds on the second-order coding rate when is a constant or grows sublinearly in ; the second result presents lower and upper bounds on the -capacity when grows linearly in , where the two bounds coincide for random variables with continuous and strictly increasing cdf. In Section IV, we present the proof of the first main result, which relies on a save-and-transmit achievability lemma and a converse lemma. The proofs of the achievability and converse lemmas are provided respectively in Sections V and VI, which are briefly described as follows. Section V describes the save-and-transmit strategy which is the key to the achievability part of the first result. More specifically, we use Shannonâs achievability bound [17] to prove a non-asymptotic achievable rate for the save-and-transmit strategy. Section VI proves the converse part of the first result, and the proof technique involves simplifying a non-asymptotic bound derived from the type-II error of a binary hypothesis test. In Section VII, we provide the proof of the second result when grows linearly in . Concluding remarks are provided in Section VIII.
I-D Notation
The sets of natural, real and non-negative real numbers are denoted by , and respectively. We let be the indicator function of the set . An arbitrary (discrete or continuous) random variable is denoted by an upper case letter (e.g., ), and the realization and alphabet of the random variable are denoted by the corresponding lower case letter (e.g., ) and calligraphic letter (e.g., ) respectively. We use to denote the random tuple .
The following notations are used for any arbitrary random variables and and any real-valued function with domain . We let and denote the probability distribution of and the conditional probability distribution of given respectively. More specifically, is the Radon-Nikodym derivative of a measure with respect to the Lebesgue measure in an appropriate Euclidean space. We let and be the evaluations of and respectively at . To make the dependence on the distribution explicit, we let denote for any set . The expectation and the variance of are denoted as and respectively. We let denote the probability density function of a Gaussian random variable whose mean and variance are and respectively, i.e.,
[TABLE]
The cdf of the standard normal distribution is denoted by , i.e.,
[TABLE]
We will take all logarithms to base throughout this paper unless specified otherwise. The logarithm function to base is denoted by , and the natural logirhtm function is denoted by .
II Additive White Gaussian Noise Energy-Harvesting Channel with Block Energy Arrival
II-A Problem formulation
The AWGN EH channel consists of one source and one destination, denoted by and respectively. Node transmits information to node in time slots as follows. Node chooses message and sends to node , where is uniformly distributed over for some that denotes the message size. Then for each , node transmits and node receives in time slot . Let be i.i.d. random variables that satisfy ( was defined in (2)),
[TABLE]
and (hence ) for some . Each other is equal to the nearest preceding according to (3). In other words, for each and all ,
[TABLE]
The knowledge of is available at the source at time before encoding , and the destination has no access to the energy-arrival process. The length of each energy-arrival block is assumed to remain constant, grow sublinearly in , or grow linearly in . We assume the following for each :
- (I)
and are independent when conditioned on , i.e.,
[TABLE] 2. (II)
Every codeword transmitted by must satisfy the harvested energy constraint
[TABLE]
for each and each .
Assumption (I) is a mathematical statement of the following fact due to the block i.i.d. EH process: If is the first energy-arrival random variable in a block, then it is independent of any random variables that are generated before time . Otherwise, equals . In both cases, and are independent when conditioned on .
After time slots, node declares to be the transmitted based on . The standard definitions are formally stated in the following subsection.
II-B Standard definitions
Definition 1
An -code consists of the following:
A message set at node . Message is uniform on . 2. 2.
*A sequence of encoding functions for each , where is the encoding function for node at time slot for encoding such that and (15) holds. * 3. 3.
A decoding function for decoding at node where the message estimate is produced by setting .
Definition 2
The AWGN EH channel is characterized by . The distribution induced by any -code used for the AWGN EH channel follows the channel law below: For each ,
[TABLE]
*where *
[TABLE]
for all and . Since does not depend on by (17), the channel is stationary.
For any -code defined on the AWGN EH channel, let be the joint distribution induced by the code. We can use Definition 1, (14) and (16) to factorize as follows:
[TABLE]
Definition 3
For an -code defined on the AWGN EH channel, we can calculate, according to (18), the average probability of decoding error defined as {\mathrm{Pr}}\big{\{}\hat{W}\neq W\big{\}}. We call an -code with average probability of decoding error no larger than an -code.
Definition 4
Let be a real number. A rate is -achievable for the AWGN EH channel if there exists a sequence of -codes222Although always depends on , it is not explicitly indicated to simplify notation. such that
[TABLE]
Definition 5
Let be a real number. The -capacity of the AWGN EH channel, denoted by , is defined to be C_{\varepsilon}\triangleq\sup\{R\,|R\text{ is \varepsilon-achievable}\}.
III Main Results
Section III-A contains the first main result in this paper, which concerns the -capacity and the second-order coding rate when is a constant or grows sublinearly in . Section III-B contains the second main result in this paper, which concerns the -capacity when grows linearly in .
III-A When is a constant or grows sublinearly inÂ
In this section, we assume that is a constant or so that . Our goal in this section is to formalize the results in (7) and (8). Before presenting the first main result, we define the second-order achievable rate as follows.
Definition 6
Let . A real number is said to be a second-order -achievable rate if there exists a sequence of -codes such that333Although can depend on , it is not explicitly indicated to simplify notation.
[TABLE]
The justification of the choice of in (20) will be explained after the following definition concerning the second-order coding rate is presented.
Definition 7
Let . The -second-order coding rate is defined as
[TABLE]
The choice of in (20) can be justified as follows by inspecting (29) in the main theorem. More specifically, if we replace in (20) with any such that and define as in Definition 7, it will then follow from (29) that
[TABLE]
Our choice of in (20) is analogous to the choice of in [18, Sec. II-D] which studies the -second-order coding rate of channels with states.
We are ready to present the first main result in this paper.
Theorem 1
Suppose is a constant or . Fix any . Recalling the definition of in (5), we have
[TABLE]
In addition, define
[TABLE]
[TABLE]
and
[TABLE]
Then, the -second-order coding rate satisfies
[TABLE]
In other words, if we let
[TABLE]
be the maximum alphabet size of the message we can transmit using an -code, then
[TABLE]
Theorem 1 presents a complicated lower bound on as stated in (25). The following corollary presents a simpler lower bound, which implies that scales as -O\Big{(}\sqrt{\log\frac{1}{\varepsilon}}\Big{)}. Since the proof of Corollary 1 is straightforward, it is relegated to Appendix A.
Corollary 1
Fix an . Following the definitions in Theorem 1, if we define
[TABLE]
then
[TABLE]
The following corollary presents an explicit bound on , whose proof relies on Corollary 1 and is relegated to Appendix B.
Corollary 2
Fix an (note that ). Following the definitions in Theorem 1, we have
[TABLE]
This work does not intend to optimize the bound in (32), which can be arbitrarily large as approaches infinity or approaches [math].
III-B When grows linearly inÂ
Before presenting the second main result, we make some necessary definitions. Fix an arbitrary and assume that
[TABLE]
Define
[TABLE]
and
[TABLE]
to be the quotient and remainder respectively resulting from dividing by . The following theorem is our second main result, which provides lower and upper bounds on . The proof of the lower and upper bounds will be given in Sections VII-A and VII-B respectively.
Theorem 2
Suppose grows linearly in according to (33) for some constant . Let and be as defined in (34) and (35) respectively, and recall that . Define
[TABLE]
and
[TABLE]
Then, we have
[TABLE]
for all .
The following corollary identifies a sufficient condition under which can be fully characterized by Theorem 2. The proof of Corollary 3 is straightforward and hence relegated to Appendix C.
Corollary 3
Under the setting of Theorem 2, if we further assume that has a continuous and strictly increasing cdf, then
[TABLE]
holds for all where is the unique threshold that satisfies
[TABLE]
III-C Remarks on Theorem 1 and Corollary 1
Remark 1
It is already known [15, Remark 1] that
[TABLE]
for all under an i.i.d. EH process. In other words, the AWGN EH channel admits the strong converse property [4, Ch. 3] for , meaning that does not depend on . It follows from (23) in Theorem 1 that (41) remains to hold under a block i.i.d. EH process when . An intuitive explanation about why the strong converse property holds when is as follows. When , since the number of energy-arrival blocks grows to infinity, it follows from the strong law of large numbers that the received power converges to with probability , which leads to a strong converse proof.
Remark 2
Consider the special case where and fix any . Clearly, both in (25) and in (26) are negative. Therefore, it follows from (29) in Theorem 1 that the second-order term in the asymptotic expansion of scales as . In particular, it follows from Corollary 1 that , meaning that the second-order term scales as . This improves the previous findings in [15, Th. 1] and [16] which established that the second-order term scales as -O\Big{(}\sqrt{\frac{\log n}{n}}\Big{)} and -O\Big{(}\frac{1}{\sqrt{n\varepsilon}}\Big{)} respectively.
Remark 3
Suppose and fix any . Clearly, in (25) is negative. In addition, the left hand side (LHS) of (29) which involves is the rate achievable by the save-and-transmit strategy (whose details can be found in Section V-B). For a fixed and a fixed , since is negative, it follows from the LHS of (29) that the rate achievable by the save-and-transmit strategy will increase at a slower rate if approaches infinity at a faster rate. This can be explained by the fact that block i.i.d. EH processes with longer result in higher probabilities of âenergy outageâ â the source cannot output the desired codeword due to energy shortage. Similarly for a fixed , since increases as the variance increases, it follows that block i.i.d. EH processes with larger variance result in higher probabilities of âenergy outageâ.
Remark 4
Suppose and fix any . Clearly, in (26) is negative. For a fixed , since is negative, it follows that the right hand side (RHS) of (29) increases at a slower rate if the following holds:
- (*)
* approaches infinity at a faster rate or is increased.*
In addition, it was shown in the previous remark that the LHS of (29) increases at a slower rate if () holds. Consequently, both the LHS and RHS of (29) increase at slower rates if () holds, which implies that the maximum rate achievable by an -code increases at a slower rate if () holds.*
Remark 5
Suppose . The achievability proof of Theorem 1 is based on analyzing the save-and-transmit strategy, which was illustrated in Figure 1 and will be formally discussed in Section V. Equation (25) in Theorem 1 is indeed a lower bound on the second-order coding rate achieved by the save-and-transmit strategy. By inspecting (25) we see that the two components that dominate the lower bound achieved by save-and-transmit are the saving period (contributed by the two terms with in (25)) and the Gaussian noise (contributed by the term with in (25)). If is a constant, both components contribute to the rate loss of the lower bound on the second-order coding rate achieved by save-and-transmit because the length of the saving period is and the minimum rate backoff needed to overcome the Gaussian noise is , which correspond to the quantities and in (25) respectively. If , the term vanishes and the resultant lower bound achieved by save-and-transmit is the quantity in (25), meaning that the length of the saving period dominates the lower bound.
III-D Remarks on Theorem 2 and Corollary 3
Remark 6
Suppose grows linearly in and has a continuous and strictly increasing cdf. Using the formula of the -capacity provided by Corollary 3, we conclude that is strictly increasing on , implying that the strong converse property ceases to hold. An intuitive explanation about why the strong converse property does not hold is as follows: Since the number of energy-arrival blocks remains constant and the cdf of is continuous and strictly increasing, the received power does not converge (with probability ) to a constant, which leads to the impossibility of a strong converse.
Remark 7
Consider the special case where and the cdf of is continuous and strictly increasing. Let be the cdf of . It then follows from Corollary 3 with the identifications , and that
[TABLE]
for all , which is analogous to the -capacities (outage capacities) of slow fading channels as stated in [19, Sec. 23.3.1], the -capacities of channels with mixed states as stated in [18, Example 1], and the -capacities of mixed channels as stated in [4, Example 3.4.2].
Remark 8
Suppose grows linearly in . The achievability proof of Theorem 2 is based on designing an adaptive save-and-transmit code that enables the source to adjust the transmission rate for each energy-arrival block according to the changes of harvested energy across different energy-arrival blocks. The adaptive save-and-transmit code was illustrated in Figure 2 and will be formally discussed in Section VII-A. Equation (36) in Theorem 2 is the coding rate achievable by the adaptive save-and-transmit strategy. By inspecting (36), we see that the main event that dominates the coding rate achievable by adaptive save-and-transmit is the âslow fadingâ behavior of the EH process â the energy-harvesting rate changes slowly such that the number of energy-arrival blocks stays constant as increases.
Remark 9
*Suppose grows linearly in . The converse proof of Theorem 2 is proved by considering a typical set of energy-arrival sequences followed by simplifying the conditional type-II errors of some binary hypothesis tests where the type-II errors are conditioned on the sequences in the typical set. In particular, the typical set is defined through (221) in the converse proof in Section VII-B, and the energy-arrival sequence falls into the set with high probability by (226). *
IV Proof of Theorem 1
The achievability proof of Theorem 1 relies on the following lemma, whose proof will be presented in Section V.
Lemma 4
Fix any , and such that
[TABLE]
Recall the definition of in (24). Then for all sufficiently large , there exist a natural number
[TABLE]
and an -code such that
[TABLE]
for some constant . More specifically, is defined as
[TABLE]
where
[TABLE]
In addition, equation (45) holds for any sufficiently large that satisfies
[TABLE]
[TABLE]
and
[TABLE]
and can be chosen to satisfy
[TABLE]
Remark 10
Lemma 4 guarantees the existence of a carefully designed save-and-transmit scheme with the saving phase being no greater than the RHS of (51) and the message size being no less than the RHS of (45). Here, specifies the probability of energy outage induced by energy shortage and specifies the probability of decoding error induced by noise for the save-and-transmit scheme. As indicated by (51), the designed saving phase has to be increased as decreases. In addition, as indicated by (45), the designed message size has to be decreased as decreases.
The following corollary is a direct consequence of Lemma 4. The proof of Corollary 5 is given in Appendix D for completeness.
Corollary 5
Fix any . For any and that satisfy , there exists a sequence of -codes such that
[TABLE]
The converse proof of Theorem 1 relies on the following lemma, whose proof will be presented in Section VI.
Lemma 6
Fix any . For any sufficiently large and any -code, we have
[TABLE]
for some . More specifically, is defined as
[TABLE]
where
[TABLE]
In addition, equation (53) holds for any sufficiently large that satisfies
[TABLE]
Remark 11
For any , since is negative, it follows from Corollary 5 and Lemma 6 that the second-order term in the asymptotic expansion of is -O\bigl{(}\sqrt{L(n+L)}\bigr{)}=-O\bigl{(}\sqrt{Ln}\bigr{)}.
We are now ready to prove Theorem 1.
Proof:
For any , the left inequality of (29) follows directly from Corollary 5 and the definition of in (25). The right inequality of (29) follows directly from (53) in Lemma 6 and the definition of in (26). Using (29) and Definition 6, we obtain (27) as well as (23). â
Remark 12
Theorem 1 no longer holds when for some . As we can see above, the proof of Theorem 1 hinges on the achievability and converse results stated in Lemmas 4 and 6 respectively. However, when , both Lemmas 4 and 6 do not yield the desired respective achievability and converse bounds. This is due to the fact that the length of the saving period guaranteed by (51) in Lemma 4 grows linearly with when and hence the overall rate achievable by save-and-transmit does not converge to the desired . In addition, the upper bound (53) in Lemma 4 does not converge to the desired when .
V Proof of Lemma 4 via the Save-and-Transmit Strategy
In this section, we investigate the save-and-transmit scheme proposed in [3, Sec. IV] in the finite blocklength regime. We will use this achievability scheme to prove Lemma 4.
V-A Prerequisites
The following lemma is useful for obtaining a lower bound on the length of the energy-saving phase. The proof is deferred to Appendix E.
Lemma 7
Let and be two natural numbers. Suppose and are two sequences of i.i.d. random variables such that and are independent,
[TABLE]
[TABLE]
and
[TABLE]
Suppose there exists a sufficiently small such that and
[TABLE]
and we define
[TABLE]
Then,
[TABLE]
In order to adapt Lemma 7 to the block energy arrival setting, we define the following quantities for each and each (cf. (60) and (61)):
[TABLE]
[TABLE]
and
[TABLE]
The following corollary adapts Lemma 7 to the block energy arrival setting. Since the proof of the corollary is tedious, it is deferred to Appendix F.
Corollary 8
Fix a natural number . Suppose is a sequence of i.i.d. random variables where , and suppose is a sequence of random variables that are distributed according to (13) (in an i.i.d.-block manner with block size ). Fix an and define
[TABLE]
If
[TABLE]
and
[TABLE]
then
[TABLE]
The following lemma [17] is standard for proving achievability results in the finite blocklength regime and its proof can be found in [4, Th. 3.8.1].
Lemma 9** (Implied by Shannonâs bound [17])**
Let be the probability distribution of a pair of random variables . Let be a sequence of i.i.d. random variables where . For each and each , we have
[TABLE]
V-B Proof of Lemma 4
Fix any , and fix any and such that
[TABLE]
Define , , and as in (63), (64), (65) and (66) respectively. Fix a sufficiently large such that (48), (49) and (50) hold. Since (48) holds, it follows from the definition of in (64) that (67) also holds. Define
[TABLE]
which satisfies (68) and specifies the number of time slots which are used for saving energy. Consider the random code that uses the channel times as follows:
**Save-and-Transmit Random Codebook Construction
**Let denote the length- zero tuple. Define the distribution as
[TABLE]
In addition, define the distribution as . Construct i.i.d. random tuples denoted by such that is distributed according to , where will be carefully chosen later when we evaluate the probability of decoding error. Define
[TABLE]
for each and construct the random codebook
[TABLE]
The codebook is revealed to both the encoder and the decoder. To facilitate discussion, we let and denote the symbols in and respectively for each . Since the first symbols of each random codeword are zeros by (74), the source will just transmit [math] with probability until time slot when the amount of energy is available for encoding .
**Encoding under the EH Constraints
**For each , recalling that is the element of , we construct recursively for the random variable
[TABLE]
To send message which is uniformly distributed on , the source transmits in time slot for each . Note that the source transmits [math] with probability in the first times slots by (74) and (76), and the transmitted codeword satisfies the EH constraints (15) by (76).
**Threshold Decoding
**Upon receiving
[TABLE]
where
[TABLE]
denotes the transmitted tuple specified in (76) and by the channel law (cf. (17)), the destination constructs its subtuple denoted by by keeping only the last symbols of . Recalling that denotes the channel law and , we define the joint distribution
[TABLE]
and define the joint distribution as
[TABLE]
Then, the decoder declares (with a slight abuse of notation, we write instead of ) to be the transmitted message where is the decoding function defined as follows: If there exists a unique index such that
[TABLE]
then is assigned the value . Otherwise, is assigned a random value uniformly distributed on .
**Calculating the Probability of Violating the EH Constraints
**Defining to be the tuple containing the last symbols of , we obtain from (74), (76) and (78) that
[TABLE]
Using Corollary 8 and noting that and are independent by construction, we obtain
[TABLE]
[TABLE]
**Calculating the Probability of Decoding Error
**Defining to be the tuple containing the last symbols of and recalling and are the tuples containing the last symbols of and respectively, we obtain from (77) and (84) that
[TABLE]
where and are independent and . Following (81) and (85), we define the events
[TABLE]
and consider the following chain of inequalities for each :
[TABLE]
where
- (a)
follows from symmetry of the random codebook construction and the union bound. 2. (b)
follows from Lemma 9 and (86). 3. (c)
follows from the fact that and are independent copies of and respectively by construction.
**Applying the Berry-Esséen Theorem
**Using (79), (73) and (17) we conclude that and are independent, , and
[TABLE]
In order to ensure the first term in (90) can be bounded above by a simple term, we first define the mean , the variance and the third absolute moment of as follows: , and
[TABLE]
where the derivation of the last inequality is relegated to Appendix G-A. Clearly,
[TABLE]
After defining , and , we choose to be the unique integer that satisfies
[TABLE]
where
[TABLE]
Following (90), we obtain the following inequality where the random variables are distributed according to :
[TABLE]
where
- (a)
follows from (86) and (95). 2. (b)
follows from the Berry-Esséen theorem for i.i.d random variables [20], i.e., for all where , and denote the mean, the variance and the third absolute moment of respectively.
We are ready to compute the probability of decoding error as follows, where the random variables are distributed according to :
[TABLE]
where (a) follows from the threshold decoding rule (cf. (81) and (86)), (90) and (98).
**Obtaining a Lower Bound on the Message SizeÂ
**Using (95), (102) and the simple fact that , we conclude that the constructed code is an -code that satisfies
[TABLE]
Using Taylorâs theorem together with the fact by (50) that , we obtain
[TABLE]
where the derivation of (104) is relegated to Appendix G-B. Combining (103) and (105) and recalling the definition of in (46), we have
[TABLE]
**Obtaining an Upper Bound on the Length of Saving Phase
**Since the constructed -code satisfies (45) by (106), it remains to show that satisfies (51). To this end, recall the definition of in (72) and consider the following bounds on , and :
[TABLE]
[TABLE]
and
[TABLE]
In order to obtain an upper bound on , consider the following chain of inequalities:
[TABLE]
where
- (a)
follows from (115) and the definition of in (64). 2. (b)
follows from (112) and the definition of in (64).
Consequently, the constructed -code satisfies (45) and (51) by (106) and (118) respectively. This completes the proof.
VI Proof of Lemma 6 via Binary Hypothesis Testing
VI-A Prerequisites
The following definition concerning the non-asymptotic fundamental limits of a simple binary hypothesis test is standard. See for example [21, Section III-E].
Definition 8
Let and be two probability distributions defined on some common alphabet . Let
[TABLE]
be the set of randomized binary hypothesis tests between and where indicates the test chooses , and let be a real number. The minimum type-II error in a simple binary hypothesis test between and with type-I error no larger than is defined as
[TABLE]
The existence of a minimizing test is guaranteed by the Neyman-Pearson lemma.
We state in the following lemma and proposition some important properties of , which are crucial for the proof of Theorem 1. The proof of the two statements in the following lemma can be found in [22, Lemma 1] and [23, Sec. 2.3] respectively.
Lemma 10
Let and be two probability distributions defined on some , and let be a function whose domain contains . Then, the following two statements hold:
(Data processing inequality (DPI)) . 2. 2.
For all , .
The proof of the following proposition is similar to Lemma 3 in [22] and therefore omitted.
Proposition 11
Let and be two probability distributions defined on and respectively for some , and let be the marginal distributions of . Suppose is the uniform distribution, and let
[TABLE]
be a real number in . Then,
[TABLE]
VI-B Proof of Lemma 6
Fix an , an which is larger than the RHS of (56) and an -code for the AWGN EH channel. Using Definition 1, we have
[TABLE]
for the -code. Define
[TABLE]
to be the smallest positive integer such that is a multiple of . Then, we can always construct an -code by appending carefully chosen to each transmitted sequence generated by the -code such that
[TABLE]
The technique of transforming the peak power inequality constraint (123) to a power equality constraint (125) by appending an extra symbol has been employed in [21, Lemma 39] and [24, Theorem 4.4] (and is called the Yaglom-map trick). To simplify notation, we let
[TABLE]
where is a multiple of and satisfies (56).
**Obtaining a Lower Bound on the Error Probability in Terms of the Type-II Error of a Hypothesis Test
**Let be the probability distribution induced by the -code constructed above, where can be expressed according to (18). In view of (125), we assume without loss of generality that
[TABLE]
for all Borel measurable . All the probability and expectation terms in the rest of this proof are evaluated according to unless specified otherwise. Define
[TABLE]
where
[TABLE]
It follows from Proposition 11 and Definition 1 with the identifications , , , and that
[TABLE]
**Using the DPI to Introduce the Channel Inputs and Outputs
**Using the DPI of in Lemma 10, we have
[TABLE]
Fix a to be specified later. Since
[TABLE]
it follows from (130), the definition of in (128), (131) and Lemma 10 that
[TABLE]
**Simplifying the Non-Asymptotic Bound
**Combining (17) and (129), we have
[TABLE]
for each . Due to the power equality constraint imposed on the codewords, we have
[TABLE]
Letting
[TABLE]
for each , we obtain from (134) and (135) that
[TABLE]
Combining (133) and (137), we have
[TABLE]
**Evaluating the Distribution of the Sum of Random Variables
**In order to simplify (138), we now investigate the distribution of the sum of random variables . We will show in the following that the distribution of can be evaluated in closed form. Define the function
[TABLE]
We begin evaluating the distribution of by examining the distribution of (cf. (136)) as follows. Let
[TABLE]
be the characteristic function of where denotes the imaginary unit. In order to evaluate a closed-form expression for (140), we write
[TABLE]
In order to simplify the RHS of (142), we consider the following chain of equalities for each :
[TABLE]
Since is a standard normal random variable which is independent of by the channel law in (17), it follows from straightforward calculations based on (139) that the following equality holds with probability for each :
[TABLE]
Using (144) and (146), we have the following equality that holds with probability for each :
[TABLE]
Combining (142) and (147), we obtain
[TABLE]
Let be independent copies of the standard normal random variable. Straightforward calculations reveal that
[TABLE]
Therefore,
[TABLE]
by (148) and (149), i.e., the characteristic functions of and are equal. Consequently, the probability distributions of and are equal almost everywhere, which implies from (136) and (139) that the probability distributions of and are equal almost everywhere, which then implies from (138) that
[TABLE]
We recall from (124) that is a multiple of and define
[TABLE]
for each (cf. the definition of in (2)). Then, equation (151) can be rewritten as
[TABLE]
**Applying the Berry-Esséen Theorem
**Using the facts that are i.i.d., are i.i.d. and
[TABLE]
for each , we conclude in view of (152) that are i.i.d. where
[TABLE]
In order to invoke the Berry-Esséen Theorem to bound the probability term in (153), we define the following quantities related to :
[TABLE]
[TABLE]
and
[TABLE]
where the derivation of the last inequality is relegated to Appendix H-A. Recalling the definition of in (55), we use the Berry-Esséen theorem for i.i.d. random variables [20] to obtain
[TABLE]
where the argument of satisfies
[TABLE]
Following (153) and letting
[TABLE]
we can express (153) as
[TABLE]
which implies from the definition of in (158), the inequality in (162) and the definition of in (152) that
[TABLE]
Using Taylorâs theorem together with the fact by (50) that , we obtain
[TABLE]
whose derivation is relegated to Appendix H-B. Combining (166) and (167) and recalling the definition of in (54), we have
[TABLE]
Using (168) and the fact by (124) and (126) that , we have
[TABLE]
This completes the proof.
VII When the Length of Each Energy Arrival Block Grows Linearly in Blocklength
This section focuses on the scenario for some real constant . Define
[TABLE]
to be the number of length- energy-arrival blocks. The total number of energy-arrival blocks is where the length of each of the first energy-arrival blocks equals and the length of the energy-arrival block equals
[TABLE]
The following proposition gives us a lower bound and an upper bound on the length of the energy-arrival block, which are useful for the achievability and the converse proofs of Theorem 2 respectively. The proof of Proposition 12 is straightforward and is deferred to Appendix I.
Proposition 12
For all sufficiently large ,
[TABLE]
and
[TABLE]
where is the length of the energy-arrival block and and were defined in (34) and (35) respectively.
VII-A Achievability proof of Theorem 2
In this section, we propose an adaptive save-and-transmit code which will be used to prove the achievability part of Theorem 2. The adaptive save-and-transmit code enables the source to transmit information at a rate close to for each block . For each block , since the destination does not know the EH random variables , the source first needs to quantize and convey the quantized version to the destination before adjusting the transmission rate. To facilitate discussion, we let
[TABLE]
and define the set of quantization points
[TABLE]
In addition, define the quantization mapping such that is the unique quantization point that satisfies
[TABLE]
In order to enable communication at a rate close to and with error probability in block for each , we propose to use an adaptive save-and-transmit code in each block so that node can adapt the coding rate to the EH process.
Definition 9
An -adaptive code consists of the following:
A message alphabet . The message is a sequence of i.i.d. uniform bits. 2. 2.
An adaptive encoding function which depends on such that
[TABLE]
and
[TABLE]
for each . 3. 3.
A decoding function for decoding at node such that the message estimate is produced by setting . Define the mapping as
[TABLE]
Then, the probability of decoding error adapted to , which is defined as
[TABLE]
is no larger than .
By Definition 9, node can use an -adaptive code to transmit bits to node with small error probability in each length- energy-arrival block. We use âadaptiveâ to describe the code because the number of bits that can be conveyed by the code changes with . We will prove the achievability part of Theorem 2 by using an adaptive code that has the following two features for every :
- (i)
Each of the first symbols in the block sent by node is the constant symbol so that with probability larger than , the destination can estimate correctly. 2. (ii)
In the remaining symbols in the block, node intends to use a Gaussian codebook with average power to transmit i.i.d. uniform bits at a rate close to and with error probability .
Feature (i) is based on Proposition 13 to be presented later. Feature (ii) will be established through proving the existence of an adaptive code with the desired properties in Lemma 15. The proof of the following proposition is simple and thus deferred to Appendix J.
Proposition 13
[TABLE]
The following lemma is useful for proving the achievability part of Theorem 2. Since Lemma 14 is a direct consequence of [15, Th. 1], its proof is relegated to Appendix K.
Lemma 14
The following statement holds for any sufficiently large . Fix an arbitrary and suppose holds with probability . Then, there exists an -code such that
[TABLE]
To facilitate discussion, we call the -code an -code.
The following lemma is based on Proposition 13 and Lemma 14.
Lemma 15
For any sufficiently large , there exists an -adaptive code.
Proof:
We construct an -adaptive code in two steps as follows.
In each of the first time slots in the length- block, node sends the constant symbol , which is always possible because by (176). Upon receiving , node produces an estimate of , denoted by , by setting
[TABLE]
It follows from the definition of in (175), Proposition 13 and (183) that
[TABLE] 2. 2.
In the remaining time slots, node will choose an -code based on the knowledge of as follows: Node calculates and transmits i.i.d. uniform bits using a predetermined -code whose existence is guaranteed by Lemma 14. The encoding strategy of is known to node , which will decode the bits using the decoder of the -code predetermined by node and output the bits estimate . By the definition of the codes,
[TABLE]
For the adaptive code described above, it follows from (183) and (185) together with the union bound that
[TABLE]
which implies that the adaptive code is an -adaptive code. â
We are ready to prove the achievability part of Theorem 2.
Proof:
Fix an . Our goal is to prove
[TABLE]
It suffices to show that
[TABLE]
for all . Fix an arbitrary . By the definition of in (36), we have
[TABLE]
for some . Let be a sufficiently large number such that
[TABLE]
We want to show that there exists a sequence of -codes such that
[TABLE]
which will then imply (188). To this end, fix a sufficiently large such that (172) holds, (173) holds,
[TABLE]
and
[TABLE]
where is as defined in (174). The number of i.i.d. uniform bits that can be transmitted by the code is chosen to be
[TABLE]
In the rest of the proof, we are devoted to constructing an -code followed by showing that the error probability is bounded above by .
**Construction of an -code:
**Recall that the length of each of the first blocks is . To facilitate discussion, define
[TABLE]
to be the length of the block for each , and define
[TABLE]
to be a lower bound on the length of the block (due to (173)). Since by construction, we will construct an -code by concatenating blocks of length- adaptive codes and one block of length- adaptive code as described below. The message of the -code is a sequence of i.i.d. uniform bits denoted by . Then for each block , node uses an -adaptive code to transmit i.i.d. uniform bits. A decoding error is declared if one of the following cases occurs:
- (i)
The total number of transmitted bits is less than , i.e., the following event occurs:
[TABLE]
- (ii)
Provided that occurs, the bits estimates output by denoted by are not equal to the transmitted bits, i.e., the following event occurs:
[TABLE]
**Analysis of the Error Probability:
**In the rest of the proof, all the probability terms are evaluated according to the distribution induced by the -code constructed above. Since the -code is a concatenation of blocks of adaptive codes and one block of adaptive code, it follows from Definition 9 and the union bound that
[TABLE]
which together with (172) and (197) implies that the error probability of the -code is bounded above as
[TABLE]
In order to obtain an upper bound on the last term in (200) in terms of , we consider
[TABLE]
where
- (a)
follows from the definition of in (176) and the fact that
[TABLE] 2. (b)
follows from (190) and the union bound.
On the other hand, combining (189) with the fact that are i.i.d., we have
[TABLE]
which implies that
[TABLE]
which then together with (195), (196) and (194) implies that
[TABLE]
Using (209), (190) and the union bound, we obtain
[TABLE]
Combining (205), (210) and (193), we have
[TABLE]
Using (200), (192) and (211), we have
[TABLE]
Therefore, the constructed -code is an -code where satisfies (194). Consequently, for any , there exists a sequence of -codes where satisfies (194) such that (191) holds, which then implies (188). Since is arbitrary, we have (187). â
VII-B Converse Proof of Theorem 2
Fix an . Our goal is to prove
[TABLE]
It suffices to show that
[TABLE]
for all . Fix an arbitrary and an -achievable rate . By the definition of in (37),
[TABLE]
for some . Let be a sufficiently large number such that
[TABLE]
In addition, since is -achievable, it follows from Definition 4 that there exists a sequence of -codes such that
[TABLE]
Fix a sufficiently large such that (172) holds, (173) holds and
[TABLE]
and fix the corresponding -code. Let be the probability distribution induced by the -code. Unless specified otherwise, all the probability, expectation and variance terms are evaluated according to . Since are i.i.d. by assumption, it follows from (216) that
[TABLE]
Define
[TABLE]
and
[TABLE]
Since the average error probability of the code is no larger than , we have
[TABLE]
Consider
[TABLE]
where (a) follows from (222) and Markovâs inequality, which implies that
[TABLE]
**Obtaining a Lower Bound on the Error Probability in Terms of the Type-II Error of a Hypothesis Test
**Define
[TABLE]
for all where
[TABLE]
It follows from Proposition 11 and Definition 1 with the identifications , , , and that
[TABLE]
for all .
**Using the DPI to Introduce the Channel Inputs and Outputs
**Using the DPI of in Lemma 10, we have
[TABLE]
for all . For each , fix a to be specified later. Since
[TABLE]
it follows from (229), the definition of in (227), (230) and Lemma 10 that
[TABLE]
for all .
**Simplifying the Non-Asymptotic Bound
**Combining (17) and (228), we have
[TABLE]
for each and each . Let
[TABLE]
for each . Then, it follows from (234) and the energy-harvesting constraints (15) that
[TABLE]
Combining (233) and (236), we have for each
[TABLE]
In order to simplify the RHS of (237), we choose
[TABLE]
recall the definition of in (221) and rewrite (237) as
[TABLE]
**Applying Chebyshevâs inequality
**Following (239), we evaluate for each
[TABLE]
and
[TABLE]
where (a) and (b) are due to the definition of in (235) and the following fact: are i.i.d. standard normal random variables that are independent of . Using (240), (244) and Chebyshevâs inequality, we have for each
[TABLE]
Combining (239) and (246), we have
[TABLE]
Since the number of energy-arrival blocks of length equals by (172) and the length of the last block is no larger than by (173), it follows from (247) that
[TABLE]
In order to simplify the first term in (249), we define as
[TABLE]
and consider the following chain of inequalities where the sets and are assumed to be Borel measurable:
[TABLE]
Combining (247) and (255), we obtain
[TABLE]
which together with (217) implies that
[TABLE]
Since is arbitrary and is an arbitrary -achievable rate, the inequality (213) follows from (257).
VIII Concluding Remarks and Future Work
This paper studies the -capacity and the second-order coding rate for the AWGN EH channel with an infinite battery under the assumption that the error probabilities do not vanish as the blocklength increases. The EH process is assumed to be block i.i.d. where the blocks have length .
For the case where is a constant or grows sublinearly in the blocklength , we have the following two findings stated in Theorem 1: (i) The -capacity is the same for all , i.e., the strong converse holds; (ii) A lower bound and an upper bound on the second-order coding rate have been obtained. where the lower bound is obtained by analyzing the conventional save-and-transmit strategy [3].
For the case where grows linearly in , we prove in Theorem 2 a lower bound and an upper bound on the -capacity.
Two interesting directions for future research are obtaining the full characterization of the -capacity and good approximations on the second-order coding rate for , i.e., a strengthening of Theorem 2. In addition, while this work investigates only optimal codes which have high decoding complexity, future research may compare the performances between an industrial low-complexity yet (first-order) optimal code for the AWGN channel and its (adaptive) save-and-transmit counterpart for the AWGN EH channel (as performed for the binary-input EH channel in [25]). Finally, one may explore analogies among AWGN EH channels, slow fading channels and channels with mixed states for the case in which grows linearly in .
Appendix A Proof of Corollary 1
In view of Theorem 1, it suffices to prove that for the case where is a constant. To this end, we let be a fixed constant and consider the following chain of inequalities:
[TABLE]
where
- (a)
is due to the easily verified fact that
[TABLE] 2. (b)
is due to the fact that \varepsilon<1/2<\min\big{\{}1-\frac{1}{\sqrt{2\pi\mathrm{e}}},\frac{1}{\sqrt{e}}\big{\}}.
Appendix B Proof of Corollary 2
Fix any . Using the easily verified fact that , we obtain that
[TABLE]
which together with (31) and (30) implies that (32) holds. The rest of the proof is dedicated to showing . Let . Since due to the assumption that , we have , which then implies that .
Appendix C Proof of Corollary 3
Suppose has a continuous and strictly increasing cdf (i.e., the mapping is continuous and strictly increasing on ). It follows that
[TABLE]
is continuous in and strictly increasing, which then implies that
[TABLE]
which together with Theorem 2 implies that (39) holds for all .
Appendix D Proof of Corollary 5
To facilitate discussion, let denote the RHS of (51), and simple calculations reveal that
[TABLE]
For each , let be the unique natural number that satisfies
[TABLE]
It is clear from (272) and (273) that
[TABLE]
By Lemma 4, there exists for each sufficiently large an -code such that (45) holds, which implies from the left inequality in (273) that for each sufficiently large , there exists an -code such that
[TABLE]
which then implies from the right inequality in (273) that
[TABLE]
Combining the facts that
[TABLE]
[TABLE]
and
[TABLE]
we conclude that
[TABLE]
This completes the proof.
Appendix E Proof of Lemma 7
In this proof, all the probability, expectation and variance terms are evaluated according to . In order to obtain an upper bound on , we construct the following sequence denoted by . For each , define recursively444The construction of is inspired by a standard proof of Kolmogorovâs inequality. as
[TABLE]
By inspecting (282), we have
[TABLE]
which implies that
[TABLE]
Define for each
[TABLE]
Following (284), we consider the following chain of inequalities for any :
[TABLE]
where (a) follows from Markovâs inequality. In order to simplify the RHS of (289), we consider the following chain of inequalities for each :
[TABLE]
where (a) follows from the independence between and due to the independence between and . Combining (289) and (294), we have
[TABLE]
Since
[TABLE]
by (286), it follows from (295) that
[TABLE]
Since and are independent, we can rewrite (297) as
[TABLE]
In order to simplify the RHS of (298), we use the following two facts, whose proofs can be found in [15, Appendix]: For any ,
[TABLE]
and
[TABLE]
Fix a sufficiently small such that and . Following (298), we use (299), (300) and (58) to obtain
[TABLE]
and
[TABLE]
which implies that
[TABLE]
Define and as in (60) and (61) respectively. It then follows from (301) and (303) that
[TABLE]
and
[TABLE]
Combining (284), (298), (304), (305), (299) and (300), we obtain
[TABLE]
Appendix F Proof of Corollary 8
Since is distributed according to an i.i.d.-block manner with block size , we cannot apply Lemma 7 directly for to bound . In the following, we will construct two sequences based on and so that Lemma 7 can be applied to the resultant sequences. Define
[TABLE]
and
[TABLE]
Let be a sequence of i.i.d. random variables where , and let be a sequence of random variables that are distributed according to (13). Since and , we have
[TABLE]
To simplify notation, define
[TABLE]
for each , and define
[TABLE]
for each . For each , define to be the unique integer in that satisfies , i.e., is the index of the information block that contains the symbol of . Then for each ,
[TABLE]
and
[TABLE]
with probability . Therefore,
[TABLE]
for each , which implies that
[TABLE]
which then implies from (310) that
[TABLE]
By construction, is a sequence of i.i.d. random variables with
[TABLE]
and
[TABLE]
and is a sequence of i.i.d. Gaussian random variables with
[TABLE]
[TABLE]
and
[TABLE]
Combining (318) and (321) and (322), we have
[TABLE]
and
[TABLE]
Since (323) holds, we can apply Lemma 7 to and if the following two statements hold for (which was defined in (66)):
[TABLE]
and
[TABLE]
To this end, we suppose and satisfy (67) and (68) respectively. Recalling the definition of in (64), we obtain from the definition of in (66) and (67) that
[TABLE]
which implies from (324) that
[TABLE]
In addition,
[TABLE]
Consequently, and are two sequences of i.i.d. random variables that satisfy (323), (328) and (330). Therefore, we have the following inequality due to Lemma 7 together with the definitions of and in (63) and (65) respectively and the equalities (318), (319) and (322):
[TABLE]
In order to simplify the RHS of (331), we consider
[TABLE]
Following (331), we consider
[TABLE]
where (a) and (b) follow from the fact due to (66) and (308) that . Combining (331) and (337), we have
[TABLE]
which implies from (317) that (69) holds.
Appendix G Simple derivations in the proof of Lemma 4
G-A Derivation of (93)
Consider the following chain of inequalities:
[TABLE]
where (a) follows from (91) and the triangle inequality for the -norm.
G-B Derivation of (104)
By Taylorâs theorem, we have
[TABLE]
for some real number
[TABLE]
Since
[TABLE]
and
[TABLE]
by (10) and (343) respectively, it follows that
[TABLE]
Consequently, (104) follows from (342) and (346).
Appendix H Simple derivations in the proof of Lemma 6
H-A Derivation of (160)
Consider the following chain of inequalities:
[TABLE]
where (a) follows from the triangle inequality for the -norm.
H-B Derivation of (167)
By Taylorâs theorem, we have
[TABLE]
for some real number
[TABLE]
Since
[TABLE]
by (10) and
[TABLE]
by (350), it follows that
[TABLE]
Appendix I Proof of Proposition 12
Fix any sufficiently large such that
[TABLE]
Then,
[TABLE]
In addition,
[TABLE]
Combining (358) and (362), we have
[TABLE]
It remains to prove (173). Using (363) and the definition of in (33), we have
[TABLE]
which together with the definition of in (35) implies (173).
Appendix J Proof of Proposition 13
Consider the following chain of inequalities where all the probability and expectation terms are evaluated with respect to :
[TABLE]
where (a) follows from Chebyshevâs inequality.
Appendix K Proof of Lemma 14
Since by assumption, it follows that . Therefore, for any and any sufficiently large that satisfies
[TABLE]
and
[TABLE]
we can use [15, Th. 1] to conclude that there exists an -code such that
[TABLE]
where
[TABLE]
denotes the length of the initial saving period before any transmission occurs and denotes the length of the actual transmission period. Let and fix a sufficiently large that satisfies
[TABLE]
[TABLE]
and
[TABLE]
Since (368) and (369) hold by (373) and (374), it follows from (371) and (372) that there exists an -code such that
[TABLE]
Acknowledgements
The authors would like to thank Associate Editor Michele Wigger and the three anonymous reviewers for the useful comments that greatly improve the presentation of this work.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] F. Zhang and V. K. N. Lau, âClosed-form delay-optimal power control for energy harvesting wireless system with finite energy storage,â IEEE Trans. Signal Process. , vol. 62, no. 21, pp. 5706â5715, 2014.
- 2[2] D. Tse and P. Viswanath, Fundamentals of Wireless Communication . Cambridge, U.K.: Cambridge University Press, 2005.
- 3[3] O. Ozel and S. Ulukus, âAchieving AWGN capacity under stochastic energy harvesting,â IEEE Trans. Inf. Theory , vol. 58, no. 10, pp. 6471â6483, 2012.
- 4[4] T. S. Han, Information-Spectrum Methods in Information Theory . Springer Berlin Heidelberg, 2003.
- 5[5] M. Hayashi, âInformation spectrum approach to second-order coding rate in channel coding,â IEEE Trans. Inf. Theory , vol. 55, no. 11, pp. 4947â4966, 2009.
- 6[6] R. H. Etkin, D. N. C. Tse, and H. Wang, âGaussian interference channel capacity to within one bit,â IEEE Trans. Inf. Theory , vol. 54, no. 12, pp. 5534â5562, Dec. 2008.
- 7[7] A. ĂzgĂŒr, O. LĂ©vĂȘque, and D. Tse, âOperating regimes of large wireless networks,â Foundations and TrendsÂź in Networking , vol. 5, no. 1, pp. 1â107, 2011. [Online]. Available: http://dx.doi.org/10.1561/1300000016
- 8[8] R. Rajesh, V. Sharma, and P. Viswanath, âCapacity of Gaussian channels with energy harvesting and processing cost,â IEEE Trans. Inf. Theory , vol. 60, no. 5, pp. 2563â2575, 2014.
