On Achievable Rates of AWGN Energy-Harvesting Channels with Block Energy   Arrival and Non-Vanishing Error Probabilities

Silas L. Fong; Vincent Y. F. Tan; and Ayfer \"Ozg\"ur

arXiv:1701.02088·cs.IT·September 5, 2017

On Achievable Rates of AWGN Energy-Harvesting Channels with Block Energy Arrival and Non-Vanishing Error Probabilities

Silas L. Fong, Vincent Y. F. Tan, and Ayfer \"Ozg\"ur

PDF

TL;DR

This paper characterizes the achievable communication rates of an AWGN energy-harvesting channel with block energy arrivals, providing first- and second-order asymptotic expansions and proposing adaptive strategies for different energy arrival regimes.

Contribution

It offers the first comprehensive analysis of the ε-capacity and second-order terms for AWGN EH channels with block energy arrivals, introducing an adaptive save-and-transmit strategy.

Findings

01

First-order ε-capacity characterized for various block lengths.

02

Second-order term scales as √(L/n) for fixed or sublinear L.

03

Adaptive save-and-transmit strategy improves performance for linear L growth.

Abstract

This paper investigates the achievable rates of an additive white Gaussian noise (AWGN) energy-harvesting (EH) channel with an infinite battery. The EH process is characterized by a sequence of blocks of harvested energy, which is known causally at the source. The harvested energy remains constant within a block while the harvested energy across different blocks is characterized by a sequence of independent and identically distributed (i.i.d.) random variables. The blocks have length $L$ , which can be interpreted as the coherence time of the energy arrival process. If $L$ is a constant or grows sublinearly in the blocklength $n$ , we fully characterize the first-order term in the asymptotic expansion of the maximum transmission rate subject to a fixed tolerable error probability $ε$ . The first-order term is known as the $ε$ -capacity. In addition, we obtain lower and…

Equations699

i = 1 \sum k X_{i}^{2} \leq i = 1 \sum k E_{i} \mbox a l m os t s u r e l y .

i = 1 \sum k X_{i}^{2} \leq i = 1 \sum k E_{i} \mbox a l m os t s u r e l y .

b_{ℓ} ≜ (ℓ - 1) L

b_{ℓ} ≜ (ℓ - 1) L

E_{b_{ℓ} + 1} = E_{b_{ℓ} + 2} = \dots = E_{b_{ℓ} + L}

E_{b_{ℓ} + 1} = E_{b_{ℓ} + 2} = \dots = E_{b_{ℓ} + L}

Y_{k} = X_{k} + Z_{k}

Y_{k} = X_{k} + Z_{k}

C (P) ≜ \frac{1}{2} lo g (1 + P) \mbox bi t s p er c hann e l u se,

C (P) ≜ \frac{1}{2} lo g (1 + P) \mbox bi t s p er c hann e l u se,

ω (1) = L = o (n) .

ω (1) = L = o (n) .

\frac{1}{n}\log M_{n,\varepsilon}^{*}\geq\mathrm{C}(P)+V_{\varepsilon}^{-}\sqrt{\frac{L}{n}}-o\bigg{(}\sqrt{\frac{L}{n}}\bigg{)}

\frac{1}{n}\log M_{n,\varepsilon}^{*}\geq\mathrm{C}(P)+V_{\varepsilon}^{-}\sqrt{\frac{L}{n}}-o\bigg{(}\sqrt{\frac{L}{n}}\bigg{)}

\frac{1}{n}\log M_{n,\varepsilon}^{*}\leq\mathrm{C}(P)+V_{\varepsilon}^{+}\sqrt{\frac{L}{n}}+o\bigg{(}\sqrt{\frac{L}{n}}\bigg{)}

\frac{1}{n}\log M_{n,\varepsilon}^{*}\leq\mathrm{C}(P)+V_{\varepsilon}^{+}\sqrt{\frac{L}{n}}+o\bigg{(}\sqrt{\frac{L}{n}}\bigg{)}

V_{ε} ≜ sup {S \in R n \to \infty lim inf \frac{lo g M _{n, ε}^{*} - n C ( P )}{L n} \geq S}

V_{ε} ≜ sup {S \in R n \to \infty lim inf \frac{lo g M _{n, ε}^{*} - n C ( P )}{L n} \geq S}

N (z; μ, σ^{2}) ≜ \frac{1}{2 π σ ^{2}} e^{- \frac{( z - μ ) ^{2}}{2 σ ^{2}}} .

N (z; μ, σ^{2}) ≜ \frac{1}{2 π σ ^{2}} e^{- \frac{( z - μ ) ^{2}}{2 σ ^{2}}} .

Φ (a) ≜ \int_{- \infty}^{a} N (z; 0, 1) d z .

Φ (a) ≜ \int_{- \infty}^{a} N (z; 0, 1) d z .

E [E_{1}] = P

E [E_{1}] = P

p_{E_{k} ∣ E^{k - 1}} (e_{k} ∣ e^{k - 1}) = {p_{E_{1}} (e_{k}) 1 {e_{k} = e_{k - 1}} if k = b_{ℓ} + 1 for some ℓ \in N, otherwise.

p_{E_{k} ∣ E^{k - 1}} (e_{k} ∣ e^{k - 1}) = {p_{E_{1}} (e_{k}) 1 {e_{k} = e_{k - 1}} if k = b_{ℓ} + 1 for some ℓ \in N, otherwise.

p_{W, E^{k}, X^{k - 1}, Y^{k - 1}} = p_{E_{k} ∣ E^{k - 1}} p_{W, E^{k - 1}, X^{k - 1}, Y^{k - 1}} .

p_{W, E^{k}, X^{k - 1}, Y^{k - 1}} = p_{E_{k} ∣ E^{k - 1}} p_{W, E^{k - 1}, X^{k - 1}, Y^{k - 1}} .

Pr {i = 1 \sum k X_{i}^{2} \leq i = 1 \sum k E_{i} E^{n} = e^{n}, W = w} = 1

Pr {i = 1 \sum k X_{i}^{2} \leq i = 1 \sum k E_{i} E^{n} = e^{n}, W = w} = 1

p_{W, E^{k}, X^{k}, Y^{k}} = p_{W, E^{k}, X^{k}, Y^{k - 1}} p_{Y_{k} ∣ X_{k}}

p_{W, E^{k}, X^{k}, Y^{k}} = p_{W, E^{k}, X^{k}, Y^{k - 1}} p_{Y_{k} ∣ X_{k}}

p_{Y_{k} ∣ X_{k}} (y_{k} ∣ x_{k}) = q_{Y ∣ X} (y_{k} ∣ x_{k}) = N (y_{k} - x_{k}; 0, 1)

p_{Y_{k} ∣ X_{k}} (y_{k} ∣ x_{k}) = q_{Y ∣ X} (y_{k} ∣ x_{k}) = N (y_{k} - x_{k}; 0, 1)

p_{W, E^{n}, X^{n}, Y^{n}, \hat{W}}

p_{W, E^{n}, X^{n}, Y^{n}, \hat{W}}

n \to \infty lim inf \frac{1}{n} lo g M \geq R .

n \to \infty lim inf \frac{1}{n} lo g M \geq R .

n \to \infty lim inf \frac{lo g M - n C _{ε}}{L n} \geq S .

n \to \infty lim inf \frac{lo g M - n C _{ε}}{L n} \geq S .

V_{ε} ≜ sup {S \in R ∣ S is a second-order ε -achievable rate} .

V_{ε} ≜ sup {S \in R ∣ S is a second-order ε -achievable rate} .

V_{ε}^{*} = ⎩ ⎨ ⎧ 0 + \infty or - \infty if n \to \infty lim \frac{L n}{f ( n )} = 0, if n \to \infty lim \frac{L n}{f ( n )} = \infty.

V_{ε}^{*} = ⎩ ⎨ ⎧ 0 + \infty or - \infty if n \to \infty lim \frac{L n}{f ( n )} = 0, if n \to \infty lim \frac{L n}{f ( n )} = \infty.

C_{ε} = C (P) .

C_{ε} = C (P) .

ϱ ≜ 2 (\frac{E [ E _{1}^{2} ]}{P ^{2}} + 1),

ϱ ≜ 2 (\frac{E [ E _{1}^{2} ]}{P ^{2}} + 1),

V_{ε}^{-} ≜ ⎩ ⎨ ⎧ - C (P) ϱ lo g \frac{1}{ε} (ε_{1}, ε_{2}) \in (0, 1)^{2} : ε_{1} + ε_{2} = ε sup {- C (P) ϱ lo g \frac{1}{ε _{1}} + \frac{P ( l o g e ) ^{2}}{L ( 1 + P )} Φ^{- 1} (ε_{2})} if ω (1) = L = o (n), if L is a constant,

V_{ε}^{-} ≜ ⎩ ⎨ ⎧ - C (P) ϱ lo g \frac{1}{ε} (ε_{1}, ε_{2}) \in (0, 1)^{2} : ε_{1} + ε_{2} = ε sup {- C (P) ϱ lo g \frac{1}{ε _{1}} + \frac{P ( l o g e ) ^{2}}{L ( 1 + P )} Φ^{- 1} (ε_{2})} if ω (1) = L = o (n), if L is a constant,

V_{ε}^{+} ≜ \frac{lo g e}{2 ( 1 + P )} 2 P^{2} + E [E_{1}^{2}] Φ^{- 1} (ε) .

V_{ε}^{+} ≜ \frac{lo g e}{2 ( 1 + P )} 2 P^{2} + E [E_{1}^{2}] Φ^{- 1} (ε) .

V_{ε}^{-} \leq V_{ε} \leq V_{ε}^{+} .

V_{ε}^{-} \leq V_{ε} \leq V_{ε}^{+} .

M_{n, ε}^{*} ≜ sup {M \in N ∣ There exists an (n, M, ε) -code}

M_{n, ε}^{*} ≜ sup {M \in N ∣ There exists an (n, M, ε) -code}

\displaystyle\mathrm{C}(P)+V_{\varepsilon}^{-}\sqrt{\frac{L}{n}}-o\bigg{(}\sqrt{\frac{L}{n}}\bigg{)}\leq\frac{1}{n}\log M_{n,\varepsilon}^{*}\leq\mathrm{C}(P)+V_{\varepsilon}^{+}\sqrt{\frac{L}{n}}+o\bigg{(}\sqrt{\frac{L}{n}}\bigg{)}.

\displaystyle\mathrm{C}(P)+V_{\varepsilon}^{-}\sqrt{\frac{L}{n}}-o\bigg{(}\sqrt{\frac{L}{n}}\bigg{)}\leq\frac{1}{n}\log M_{n,\varepsilon}^{*}\leq\mathrm{C}(P)+V_{\varepsilon}^{+}\sqrt{\frac{L}{n}}+o\bigg{(}\sqrt{\frac{L}{n}}\bigg{)}.

V_{ε}^{--} ≜ ⎩ ⎨ ⎧ - C (P) ϱ lo g \frac{1}{ε} - (C (P) 2 ϱ + \frac{4 P l o g e}{1 + P}) lo g \frac{1}{ε} if ω (1) = L = o (n), if L is a constant,

V_{ε}^{--} ≜ ⎩ ⎨ ⎧ - C (P) ϱ lo g \frac{1}{ε} - (C (P) 2 ϱ + \frac{4 P l o g e}{1 + P}) lo g \frac{1}{ε} if ω (1) = L = o (n), if L is a constant,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

On Achievable Rates of AWGN Energy-Harvesting Channels with Block Energy Arrival and Non-Vanishing Error Probabilities

Silas L. Fong, Vincent Y. F. Tan, and Ayfer Özgür S. L. Fong is with the Department of Electrical and Computer Engineering, National University of Singapore, Singapore 117583 (e-mail: [email protected]). V. Y. F. Tan is with the Department of Electrical and Computer Engineering, National University of Singapore, Singapore 117583, and also with the Department of Mathematics, National University of Singapore, Singapore 119076 (e-mail: [email protected]).A. Özgür is with the Department of Electrical Engineering, Stanford University, CA 94305, USA (email: [email protected]).

Abstract

This paper investigates the achievable rates of an additive white Gaussian noise (AWGN) energy-harvesting (EH) channel with an infinite battery. The EH process is characterized by a sequence of blocks of harvested energy, which is known causally at the source. The harvested energy remains constant within a block while the harvested energy across different blocks is characterized by a sequence of independent and identically distributed (i.i.d.) random variables. The blocks have length $L$ , which can be interpreted as the coherence time of the energy arrival process. If $L$ is a constant or grows sublinearly in the blocklength $n$ , we fully characterize the first-order term in the asymptotic expansion of the maximum transmission rate subject to a fixed tolerable error probability $\varepsilon$ . The first-order term is known as the $\varepsilon$ -capacity. In addition, we obtain lower and upper bounds on the second-order term in the asymptotic expansion, which reveal that the second order term scales as $\sqrt{\frac{L}{n}}$ for any $\varepsilon$ less than $1/2$ . The lower bound is obtained through analyzing the save-and-transmit strategy. If $L$ grows linearly in $n$ , we obtain lower and upper bounds on the $\varepsilon$ -capacity, which coincide whenever the cumulative distribution function (cdf) of the EH random variable is continuous and strictly increasing. In order to achieve the lower bound, we have proposed a novel adaptive save-and-transmit strategy, which chooses different save-and-transmit codes across different blocks according to the energy variation across the blocks.

Index Terms:

Achievable rates, block energy arrival, energy-harvesting, non-vanishing error probabilities, save-and-transmit

I Introduction

The energy-harvesting (EH) channel consists of one source equipped with an energy buffer (also called battery), and one destination. For simplicity, we assume in this paper that the buffer has infinite capacity. At each discrete time $k\in\{1,2,\ldots\}$ , a random amount of energy $E_{k}\in[0,\infty)$ arrives at the buffer and the source transmits a symbol $X_{k}\in(-\infty,\infty)$ such that

[TABLE]

This implies that the total harvested energy $\sum_{i=1}^{k}E_{i}$ must be no smaller than the energy of the codeword $\sum_{i=1}^{k}X_{i}^{2}$ at every discrete time $k$ for transmission to take place successfully. The knowledge of $E_{k}$ is available at the source at time $k$ before encoding $X_{k}$ , and the destination has no access to the energy-arrival process.

We assume that $\{E_{i}\}_{i=1}^{\infty}$ arrive at the buffer in a block-by-block manner as follows: For each $\ell\in\mathbb{N}$ , let

[TABLE]

such that $b_{\ell}+1$ is the index of the first channel use within the $\ell^{\text{th}}$ block of energy arrival, where $L$ denotes the length of each block. In other words, the $\ell^{\text{th}}$ block of energy arrival starts at the $(b_{\ell}+1)^{\text{th}}$ channel use. The EH random variables that mark the starting points of the blocks (i.e., $\{E_{b_{\ell}+1}\}_{\ell=1}^{\infty}$ ) are assumed to be independent and identically distributed (i.i.d.) random variables where111If the constraint ${\mathbb{E}}[E_{1}^{3}]<+\infty$ is replaced with the less stringent one ${\mathbb{E}}[E_{1}^{2}]<+\infty$ , all the achievability results in this paper continue to hold. In fact, the only place that requires ${\mathbb{E}}[E_{1}^{3}]<+\infty$ is the use of the Berry-Esséen theorem in Section VI-B in the course of proving the converse of Theorem 1. ${\mathbb{E}}[E_{1}^{3}]<+\infty$ and ${\mathbb{E}}[E_{1}]=P$ for some $P>0$ . A large class of distributions of practical interests satisfy the third-moment assumption including those with well-defined moment generating function. In addition, we assume

[TABLE]

for all $\ell\in\mathbb{N}$ . In other words, the harvested energy in each channel use within a block remains constant while the harvested energy across different blocks is characterized by a sequence of i.i.d. random variables with mean equal to $P$ . This block-by-block energy-arrival assumption is useful for modeling practical scenarios when the energy-arrival process evolves at a slower timescale compared to the transmission process [1, Sec. II-C]. This is often the case for most natural energy processes, such as solar energy or wind energy. The block i.i.d. EH process can model, for example, a solar panel which harvests energy from the sun, and the appearance of clouds can change randomly and block certain amounts of sunshine for a certain period of time. Similarly, this is a good model for a device which harvests RF energy from other transmitting devices in its environment. Such transmitting devices typically transmit continuously for certain periods of time and are silent for the remaining periods (as in TDMA, for example), which warrants a block i.i.d. model. Most importantly, the block i.i.d. model provides a simple way to study the impact of correlations in the EH process on the system capacity. Such block i.i.d. models are popularly used in wireless communication as a means to capture correlations in the channel fading process by a simple model. In that context, the block length $L$ is called the coherence time of the channel, which corresponds to the time duration over which the channel remains approximately constant [2, Sec. 2.3]. Analogously, we refer to $L$ as the coherence time of the energy arrival process.

The channel noise of the EH channel is modeled as an additive white Gaussian noise (AWGN), which is described as follows. In each time slot $k\in\mathbb{N}$ , after the source has transmitted $X_{k}$ , the destination receives

[TABLE]

where $\{Z_{k}\}_{k=1}^{\infty}$ are i.i.d. standard normal random variables. The above EH channel is referred to as the AWGN EH channel. It was shown by Ozel and Ulukus [3] that the capacity of the AWGN EH channel for the case $L=1$ is

[TABLE]

where $P={\mathbb{E}}[E_{1}]$ is the expectation of the harvested energy for each energy arrival. In this paper, we assume that $L$ can grow with $n$ and would like to investigate how the growth rate of $L$ affects the first- and second-order terms of the asymptotic expansion of the maximum transmission rate. The first-order term is also known as the $\varepsilon$ -capacity [4, Sec. 3.4], and the second-order term divided by an appropriate scaling (which is $\frac{1}{\sqrt{n}}$ in many cases including the AWGN channel) is known as the second-order coding rate [5]. The following two cases regarding the growth rate of $L$ will be investigated in this paper:

(i)

$L$ is a constant or $L$ grows sublinearly in $n$ . The latter statement means

[TABLE] 2. (ii)

$L$ grows linearly in $n$ .

Note that in practice $L$ and $n$ are two independent parameters. The first one is dictated by the nature of the EH process and the second one is a design parameter typically dictated by the delay and reliability requirements of the application and the complexity constraints at the transmitter and the receiver. Depending on how fast the EH process changes over time, $L$ can be significantly smaller than $n$ or comparable to $n$ . In order to reveal the impact of the interplay between these two parameters on the second-order term, we couple these parameters in different ways, say $L=n^{\gamma}$ , and study the limiting case when both $L$ and $n$ approach $\infty$ for different couplings, i.e., different values of $\gamma$ in $[0,1]$ where $\gamma$ captures how large $L$ is with respect to $n$ . This allows us to identify how $L$ and $n$ together impact the second-order term. In particular, we conclude that it is the ratio of the two that determines the second-order term. The impact of correlation in the EH process can be interpreted as effectively decreasing the blocklength by a factor of $L$ . Note that keeping $L$ constant while taking $n$ to infinity, i.e., considering only the special case $\gamma=0$ , would lead to a degenerate regime where the correlation in the EH process does not play a significant role. The approach we take here has been extensively used in the wireless information theory literature to obtain asymptotic results in multi-parameter problems where the problem involves multiple independent parameters that can be large or small with respect to each other. (See for example the notion of generalized degrees of freedom in [6] and follow-up work, or Section 3.1 of [7] for a detailed discussion of a similar formulation in the context of scaling laws for wireless networks.)

I-A Main Contribution

We use $O(\cdot)$ , $o(\cdot)$ , $\omega(\cdot)$ and $\Theta(\cdot)$ denote standard asymptotic Bachmann-Landau notations except our convention that they must be non-negative. The contributions of this paper are summarized in the following:

Case (i): When $L$ is a constant or grows sublinearly in $n$

First, we prove an achievable finite blocklength bound based on the save-and-transmit strategy of [3]. During the saving phase of the save-and-transmit strategy, energy is saved for a certain number of time slots. During this period, no information is transmitted. Subsequently, during the transmission phase, the source uses a Gaussian codebook to send information. In order to analyze the performance of this save-and-transmit strategy, we construct a single sequence of random variables that characterizes the probability of the available energy being insufficient to support the Gaussian codeword (i.e., $\sum_{i=1}^{k}E_{i}<\sum_{i=1}^{k}X_{i}^{2}$ for all $k$ ) and derive a concentration bound related to the random sequence. Our analysis reveals that the backoff from capacity $\mathrm{C}(P)$ for the optimal length- $n$ code with error probability less than $\varepsilon$ is no larger than $O\Big{(}\sqrt{\frac{L}{n}}\Big{)}$ . More specifically, the maximum alphabet size of the message we can transmit over $n$ channel uses with average probability of error no larger than $\varepsilon\in(0,1/2)$ , denoted by $M_{n,\varepsilon}^{*}$ , satisfies

[TABLE]

where $V_{\varepsilon}^{-}<0$ is some constant that does not depend on $n$ . We also identify the implied constant $V_{\varepsilon}^{-}$ in Theorem 1 in Section III. The qualitative interpretation of $V_{\varepsilon}^{-}$ will be given in Remark 3 in Section III-C. The lower bound (7) is obtained by choosing the lengths of the saving phase and transmission phase to be $\Theta(\sqrt{Ln})$ and $n-\Theta(\sqrt{Ln})$ respectively, which is illustrated in Figure 1 where the accumulated harvested energy is always above the accumulated transmitted energy due to the EH constraints (1). 2. 2.

Second, we prove a non-asymptotic upper bound on achievable rates by simplifying the type-II error of a carefully chosen binary hypothesis test. The first-order term of the upper bound is $\mathrm{C}(P)$ and the second-order term is proportional to $-\sqrt{\frac{L}{n}}$ for all $\varepsilon\in(0,1/2)$ . More specifically, for all $\varepsilon\in(0,1/2)$ , we have

[TABLE]

where $V_{\varepsilon}^{+}<0$ is some constant that does not depend on $n$ . We also identify the implied constant $V_{\varepsilon}^{+}$ in Theorem 1 in Section III. The qualitative interpretation of $V_{\varepsilon}^{+}$ will be given in Remark 4 in Section III-C. Note that (7) and (8) together reveal that the back-off from $\mathrm{C}(P)$ for the optimal length- $n$ code with error probability less than $\varepsilon$ is of the order $\sqrt{\frac{L}{n}}$ . Therefore, the impact of correlation in the EH process can be interpreted as effectively decreasing the blocklength by a factor of $L$ . In other words, to achieve the same reliability, one needs to increase the blocklength by a factor equal to the coherence time of the EH process.

It is readily seen from (7) and (8) that for any fixed $\varepsilon\in(0,1/2)$ , the $\varepsilon$ -capacity is $\mathrm{C}(P)$ and the second-order term in the asymptotic expansion for the maximum achievable rate is proportional to $-\sqrt{\frac{L}{n}}$ . In addition, define

[TABLE]

to be the second-order coding rate [5]. We can see from (7) and (8) that for any fixed $\varepsilon\in(0,1/2)$ , the second-order coding rate is sandwiched between $V_{\varepsilon}^{-}$ and $V_{\varepsilon}^{+}$ .

Case (ii): When $L$ grows linearly in $n$

First, we prove a lower bound on the $\varepsilon$ -capacity, as shown in Theorem 2 in Section III, based on a modified version of the save-and-transmit strategy called the adaptive save-and-transmit strategy. Under the adaptive save-and-transmit strategy which is described in Section VII-A, different save-and-transmit codes are used across different blocks. In each block $\ell$ , the coding rate is adapted to the corresponding EH random variable $E_{b_{\ell}+1}$ so that it is close to $\mathrm{C}(E_{b_{\ell}+1})$ . In addition, the lengths of the saving phase and transmission phase for block $\ell$ are chosen to be $\Theta(\sqrt{L})$ and $L-\Theta(\sqrt{L})$ respectively as illustrated in Figure 2. 2. 2.

Second, we prove an upper bound on the $\varepsilon$ -capacity (Theorem 2). We do so by considering a typical set of sequences of EH random variables followed by simplifying the type-II error of a binary hypothesis test conditioned on the aforementioned typical set.

For any EH process whose EH random variable has a continuous and strictly increasing cumulative density function (cdf), the upper and lower bounds in Theorem 2 coincide and hence the $\varepsilon$ -capacity is fully characterized. See Remark 6 in Section III-C for a detailed discussion. Case (ii) is useful for modeling the scenario where the energy-harvesting rate changes slowly such that the number of energy-arrival blocks stays constant as $n$ increases. Since the number of energy-arrival blocks stays constant and the length of each energy-arrival block grows with $n$ , it is first-order optimal to choose an appropriate save-and-transmit scheme that achieves the maximum coding rate for each block according to the energy level in that block. Therefore, we need an adaptive save-and-transmit scheme rather than the conventional non-adaptive one to achieve the overall maximum coding rate.

I-B Related Work

The channel capacity was characterized for the AWGN channel with an i.i.d. EH process in [3] and with a stationary ergodic EH process in [8]. The aforementioned studies showed that with an unlimited battery, the capacity of the AWGN channel with stochastic energy constraints is equal to the capacity of the same channel under an average power constraint as long as the average power equals the average recharge rate of the battery. In this paper, we focus on the AWGN channel with a block EH process, where the energy arrivals remain constant for a block of duration $L$ and are independent across blocks drawn from an arbitrary distribution. A similar block i.i.d. EH model has been recently considered in [9, 10] concurrently with the current paper. However, these papers focus on the power control problem for EH communications with finite battery at the transmitter. In this paper, we rather consider the information-theoretic capacity of the channel and with infinite battery at the transmitter. Characterizing the information theoretic capacity of the channel with a finite battery is known to be a difficult problem even for an i.i.d. model for the energy arrivals and in general remains an open problem. It has been studied in several recent works [11, 12, 13, 14] and the most recent ones [13, 14] characterize the capacity within a constant gap. Due to the lack of a complete characterization of the capacity under a finite battery assumption, in this paper we focus on the AWGN EH channel with infinite battery and develop bounds on the first- and second-order terms in the asymptotic expansion of the maximum transmission rate.

For a fixed tolerable error probability $\varepsilon$ , Fong et al. [15] recently performed a finite blocklength analysis of save-and-transmit schemes proposed in [3] and obtained a non-asymptotic achievable rate for the AWGN channel with an i.i.d. EH process. The first-, second- and third-order terms of the non-asymptotic achievable rate presented in [15, Th. 1] are equal to the capacity, $-c_{1}\sqrt{\frac{\log n}{n}}$ and $-c_{2}\sqrt{\frac{2+\varepsilon}{n\varepsilon}}$ respectively where $c_{1}$ and $c_{2}$ are some positive constants that do not depend on $n$ and $\varepsilon$ . Subsequently, Shenoy and Sharma [16] refined the analysis in [15] and improved the second-order term to $-\frac{c_{3}}{\sqrt{n\varepsilon}}$ where $c_{3}$ is some positive constant that does not depend on $n$ and $\varepsilon$ . This paper further improves the second-order term to $-c_{4}\sqrt{\frac{\log(1/\varepsilon)}{n}}$ for any $\varepsilon\in(0,1/2)$ where $c_{4}$ is some positive constant that does not depend on $n$ and $\varepsilon$ (see Remark 2). The aforementioned improvements are due to better analyses of the “energy outage” probability for the same save-and-transmit strategy, where the “energy outage” occurs when the source cannot output the desired codeword due to energy shortage.

I-C Paper Outline

This paper is organized as follows. The notation used in this paper is described in the next subsection. Section II states the formulation of the AWGN EH channel with block energy arrival. Section III presents our two main results — the first result fully characterizes the $\varepsilon$ -capacity and provides lower and upper bounds on the second-order coding rate when $L$ is a constant or grows sublinearly in $n$ ; the second result presents lower and upper bounds on the $\varepsilon$ -capacity when $L$ grows linearly in $n$ , where the two bounds coincide for random variables with continuous and strictly increasing cdf. In Section IV, we present the proof of the first main result, which relies on a save-and-transmit achievability lemma and a converse lemma. The proofs of the achievability and converse lemmas are provided respectively in Sections V and VI, which are briefly described as follows. Section V describes the save-and-transmit strategy which is the key to the achievability part of the first result. More specifically, we use Shannon’s achievability bound [17] to prove a non-asymptotic achievable rate for the save-and-transmit strategy. Section VI proves the converse part of the first result, and the proof technique involves simplifying a non-asymptotic bound derived from the type-II error of a binary hypothesis test. In Section VII, we provide the proof of the second result when $L$ grows linearly in $n$ . Concluding remarks are provided in Section VIII.

I-D Notation

The sets of natural, real and non-negative real numbers are denoted by $\mathbb{N}$ , $\mathbb{R}$ and $\mathbb{R}_{+}$ respectively. We let $\boldsymbol{1}\{\mathcal{E}\}$ be the indicator function of the set $\mathcal{E}$ . An arbitrary (discrete or continuous) random variable is denoted by an upper case letter (e.g., $X$ ), and the realization and alphabet of the random variable are denoted by the corresponding lower case letter (e.g., $x$ ) and calligraphic letter (e.g., $\mathcal{X}$ ) respectively. We use $X^{n}$ to denote the random tuple $(X_{1},X_{2},\ldots,X_{n})$ .

The following notations are used for any arbitrary random variables $X$ and $Y$ and any real-valued function $g$ with domain $\mathcal{X}$ . We let $p_{X,Y}$ and $p_{Y|X}$ denote the probability distribution of $(X,Y)$ and the conditional probability distribution of $Y$ given $X$ respectively. More specifically, $p_{X,Y}$ is the Radon-Nikodym derivative of a measure with respect to the Lebesgue measure in an appropriate Euclidean space. We let $p_{X,Y}(x,y)$ and $p_{Y|X}(y|x)$ be the evaluations of $p_{X,Y}$ and $p_{Y|X}$ respectively at $(X,Y)=(x,y)$ . To make the dependence on the distribution explicit, we let ${\mathrm{Pr}}_{p_{X}}\{g(X)\in\mathcal{A}\}$ denote $\int_{\mathcal{X}}p_{X}(x)\mathbf{1}\{g(x)\in\mathcal{A}\}\,\mathrm{d}x$ for any set $\mathcal{A}\subseteq\mathbb{R}$ . The expectation and the variance of $g(X)$ are denoted as ${\mathbb{E}}_{p_{X}}[g(X)]$ and ${\mathrm{Var}}_{p_{X}}[g(X)]$ respectively. We let $\mathcal{N}(\,\cdot\,;\mu,\sigma^{2}):\mathbb{R}\rightarrow[0,\infty)$ denote the probability density function of a Gaussian random variable whose mean and variance are $\mu$ and $\sigma^{2}$ respectively, i.e.,

[TABLE]

The cdf of the standard normal distribution is denoted by $\Phi$ , i.e.,

[TABLE]

We will take all logarithms to base $2$ throughout this paper unless specified otherwise. The logarithm function to base $2$ is denoted by $\log$ , and the natural logirhtm function is denoted by $\ln$ .

II Additive White Gaussian Noise Energy-Harvesting Channel with Block Energy Arrival

II-A Problem formulation

The AWGN EH channel consists of one source and one destination, denoted by $\mathrm{s}$ and $\mathrm{d}$ respectively. Node $\mathrm{s}$ transmits information to node $\mathrm{d}$ in $n$ time slots as follows. Node $\mathrm{s}$ chooses message $W$ and sends $W$ to node $\mathrm{d}$ , where $W$ is uniformly distributed over $\{1,2,\ldots,M\}$ for some $M$ that denotes the message size. Then for each $k\in\{1,2,\ldots,n\}$ , node $\mathrm{s}$ transmits $X_{k}\in\mathbb{R}$ and node $\mathrm{d}$ receives $Y_{k}\in\mathbb{R}$ in time slot $k$ . Let $\{E_{b_{\ell}+1}\}_{\ell=1}^{\infty}$ be i.i.d. random variables that satisfy ${\mathrm{Pr}}\{E_{1}<0\}=0$ ( $b_{\ell}$ was defined in (2)),

[TABLE]

and ${\mathbb{E}}[E_{1}^{3}]<\infty$ (hence ${\mathbb{E}}[E_{1}^{2}]<\infty$ ) for some $P>0$ . Each other $E_{k}$ is equal to the nearest preceding $E_{b_{\ell}+1}$ according to (3). In other words, for each $k\in\{1,2,\ldots,n\}$ and all $e^{k}\in\mathbb{R}_{+}^{k}$ ,

[TABLE]

The knowledge of $E_{k}$ is available at the source at time $k$ before encoding $X_{k}$ , and the destination has no access to the energy-arrival process. The length of each energy-arrival block $L$ is assumed to remain constant, grow sublinearly in $n$ , or grow linearly in $n$ . We assume the following for each $k\in\{1,2,\ldots,n\}$ :

(I)

$E_{k}$ and $(W,X^{k-1},Y^{k-1})$ are independent when conditioned on $E^{k-1}$ , i.e.,

[TABLE] 2. (II)

Every codeword $X^{n}$ transmitted by $\mathrm{s}$ must satisfy the harvested energy constraint

[TABLE]

for each $e^{n}\in\mathbb{R}_{+}^{n}$ and each $w\in\mathcal{W}$ .

Assumption (I) is a mathematical statement of the following fact due to the block i.i.d. EH process: If $E_{k}$ is the first energy-arrival random variable in a block, then it is independent of any random variables that are generated before time $k$ . Otherwise, $E_{k}$ equals $E_{k-1}$ . In both cases, $E_{k}$ and $(W,X^{k-1},Y^{k-1})$ are independent when conditioned on $E^{k-1}$ .

After $n$ time slots, node $\mathrm{d}$ declares $\hat{W}$ to be the transmitted $W$ based on $Y^{n}$ . The standard definitions are formally stated in the following subsection.

II-B Standard definitions

Definition 1

An $(n,M)$ -code consists of the following:

A message set $\mathcal{W}\triangleq\{1,2,\ldots,M\}$ at node $\mathrm{s}$ . Message $W$ is uniform on $\mathcal{W}$ . 2. 2.

*A sequence of encoding functions $f_{k}:\mathcal{W}\times\mathbb{R}_{+}^{k}\rightarrow\mathbb{R}$ for each $k\in\{1,2,\ldots,n\}$ , where $f_{k}$ is the encoding function for node $\mathrm{s}$ at time slot $k$ for encoding $X_{k}$ such that $X_{k}=f_{k}(W,E^{k})$ and (15) holds. * 3. 3.

A decoding function $\varphi:\mathbb{R}^{n}\rightarrow\mathcal{W}$ for decoding $W$ at node $\mathrm{d}$ where the message estimate $\hat{W}$ is produced by setting $\hat{W}\triangleq\varphi(Y^{n})$ .

Definition 2

The AWGN EH channel is characterized by $q_{Y|X}\triangleq\mathcal{N}(y-x;0,1)$ . The distribution induced by any $(n,M)$ -code used for the AWGN EH channel follows the channel law below: For each $k\in\{1,2,\ldots,n\}$ ,

[TABLE]

*where *

[TABLE]

for all $x_{k}$ and $y_{k}$ . Since $p_{Y_{k}|X_{k}}$ does not depend on $k$ by (17), the channel is stationary.

For any $(n,M)$ -code defined on the AWGN EH channel, let $p_{W,E^{n},X^{n},Y^{n},\hat{W}}$ be the joint distribution induced by the code. We can use Definition 1, (14) and (16) to factorize $p_{W,E^{n},X^{n},Y^{n},\hat{W}}$ as follows:

[TABLE]

Definition 3

For an $(n,M)$ -code defined on the AWGN EH channel, we can calculate, according to (18), the average probability of decoding error defined as ${\mathrm{Pr}}\big{\{}\hat{W}\neq W\big{\}}$ . We call an $(n,M)$ -code with average probability of decoding error no larger than $\varepsilon$ an $(n,M,\varepsilon)$ -code.

Definition 4

Let $\varepsilon\in(0,1)$ be a real number. A rate $R$ is $\varepsilon$ -achievable for the AWGN EH channel if there exists a sequence of $(n,M,\varepsilon)$ -codes222Although $M$ always depends on $n$ , it is not explicitly indicated to simplify notation. such that

[TABLE]

Definition 5

Let $\varepsilon\in(0,1)$ be a real number. The $\varepsilon$ -capacity of the AWGN EH channel, denoted by $C_{\varepsilon}$ , is defined to be $C_{\varepsilon}\triangleq\sup\{R\,|R\text{ is$ \varepsilon $-achievable}\}$ .

III Main Results

Section III-A contains the first main result in this paper, which concerns the $\varepsilon$ -capacity and the second-order coding rate when $L$ is a constant or grows sublinearly in $n$ . Section III-B contains the second main result in this paper, which concerns the $\varepsilon$ -capacity when $L$ grows linearly in $n$ .

III-A When $L$ is a constant or grows sublinearly in $n$

In this section, we assume that $L$ is a constant or $\omega(1)=L=o(n)$ so that $\lim\limits_{n\rightarrow\infty}\frac{L}{n}=0$ . Our goal in this section is to formalize the results in (7) and (8). Before presenting the first main result, we define the second-order achievable rate as follows.

Definition 6

Let $\varepsilon\in(0,1)$ . A real number $S$ is said to be a second-order $\varepsilon$ -achievable rate if there exists a sequence of $(n,M,\varepsilon)$ -codes such that333Although $L$ can depend on $n$ , it is not explicitly indicated to simplify notation.

[TABLE]

The justification of the choice of $\sqrt{Ln}$ in (20) will be explained after the following definition concerning the second-order coding rate is presented.

Definition 7

Let $\varepsilon\in(0,1)$ . The $\varepsilon$ -second-order coding rate is defined as

[TABLE]

The choice of $\sqrt{Ln}$ in (20) can be justified as follows by inspecting (29) in the main theorem. More specifically, if we replace $\sqrt{Ln}$ in (20) with any $f(n)>0$ such that $\lim\limits_{n\rightarrow\infty}\frac{\sqrt{Ln}}{f(n)}\in\{0,\infty\}$ and define $V_{\varepsilon}^{*}$ as in Definition 7, it will then follow from (29) that

[TABLE]

Our choice of $\sqrt{Ln}$ in (20) is analogous to the choice of $n^{\beta}$ in [18, Sec. II-D] which studies the $\varepsilon$ -second-order coding rate of channels with states.

We are ready to present the first main result in this paper.

Theorem 1

Suppose $L$ is a constant or $\omega(1)=L=o(n)$ . Fix any $\varepsilon\in(0,1)$ . Recalling the definition of $\mathrm{C}(\cdot)$ in (5), we have

[TABLE]

In addition, define

[TABLE]

and

[TABLE]

Then, the $\varepsilon$ -second-order coding rate $V_{\varepsilon}$ satisfies

[TABLE]

In other words, if we let

[TABLE]

be the maximum alphabet size of the message we can transmit using an $(n,M,\varepsilon)$ -code, then

[TABLE]

Theorem 1 presents a complicated lower bound on $V_{\varepsilon}$ as stated in (25). The following corollary presents a simpler lower bound, which implies that $V_{\varepsilon}$ scales as $-O\Big{(}\sqrt{\log\frac{1}{\varepsilon}}\Big{)}$ . Since the proof of Corollary 1 is straightforward, it is relegated to Appendix A.

Corollary 1

Fix an $\varepsilon\in(0,1/2)$ . Following the definitions in Theorem 1, if we define

[TABLE]

then

[TABLE]

The following corollary presents an explicit bound on $V_{\varepsilon}^{+}-V_{\varepsilon}^{-}$ , whose proof relies on Corollary 1 and is relegated to Appendix B.

Corollary 2

Fix an $\varepsilon\in(0,\Phi(-1))$ (note that $\Phi(-1)\approx 0.1586$ ). Following the definitions in Theorem 1, we have

[TABLE]

This work does not intend to optimize the bound in (32), which can be arbitrarily large as $P$ approaches infinity or $\varepsilon$ approaches [math].

III-B When $L$ grows linearly in $n$

Before presenting the second main result, we make some necessary definitions. Fix an arbitrary $\lambda\in(0,1]$ and assume that

[TABLE]

Define

[TABLE]

and

[TABLE]

to be the quotient and remainder respectively resulting from dividing $1$ by $\lambda$ . The following theorem is our second main result, which provides lower and upper bounds on $C_{\varepsilon}$ . The proof of the lower and upper bounds will be given in Sections VII-A and VII-B respectively.

Theorem 2

Suppose $L$ grows linearly in $n$ according to (33) for some constant $\lambda\in(0,1)$ . Let $q$ and $d$ be as defined in (34) and (35) respectively, and recall that $p_{E_{1}}=p_{E_{2}}=\ldots=p_{E_{n}}$ . Define

[TABLE]

and

[TABLE]

Then, we have

[TABLE]

for all $\varepsilon\in(0,1)$ .

The following corollary identifies a sufficient condition under which $C_{\varepsilon}$ can be fully characterized by Theorem 2. The proof of Corollary 3 is straightforward and hence relegated to Appendix C.

Corollary 3

Under the setting of Theorem 2, if we further assume that $E_{1}$ has a continuous and strictly increasing cdf, then

[TABLE]

holds for all $\varepsilon\in(0,1)$ where $R_{\varepsilon}^{\text{thr}}$ is the unique threshold that satisfies

[TABLE]

III-C Remarks on Theorem 1 and Corollary 1

Remark 1

It is already known [15, Remark 1] that

[TABLE]

for all $\varepsilon\in(0,1)$ under an i.i.d. EH process. In other words, the AWGN EH channel admits the strong converse property [4, Ch. 3] for $L=1$ , meaning that $C_{\varepsilon}$ does not depend on $\varepsilon\in(0,1)$ . It follows from (23) in Theorem 1 that (41) remains to hold under a block i.i.d. EH process when $L=o(n)$ . An intuitive explanation about why the strong converse property holds when $L=o(n)$ is as follows. When $L=o(n)$ , since the number of energy-arrival blocks $n/L$ grows to infinity, it follows from the strong law of large numbers that the received power $\frac{1}{n}\sum_{k=1}^{n}E_{k}$ converges to ${\mathbb{E}}[E_{1}]=P$ with probability $1$ , which leads to a strong converse proof.

Remark 2

Consider the special case where $L=1$ and fix any $\varepsilon\in(0,1/2)$ . Clearly, both $V_{\varepsilon}^{-}$ in (25) and $V_{\varepsilon}^{+}$ in (26) are negative. Therefore, it follows from (29) in Theorem 1 that the second-order term in the asymptotic expansion of $\frac{1}{n}\log M_{n,\varepsilon}^{*}$ scales as $-\Theta\left(\sqrt{\frac{1}{n}}\right)$ . In particular, it follows from Corollary 1 that $V_{\varepsilon}^{--}\leq V_{\varepsilon}$ , meaning that the second-order term scales as $-O\left(\sqrt{\frac{\log(1/\varepsilon)}{n}}\right)$ . This improves the previous findings in [15, Th. 1] and [16] which established that the second-order term scales as $-O\Big{(}\sqrt{\frac{\log n}{n}}\Big{)}$ and $-O\Big{(}\frac{1}{\sqrt{n\varepsilon}}\Big{)}$ respectively.

Remark 3

Suppose $L=o(n)$ and fix any $\varepsilon\in(0,1/2)$ . Clearly, $V_{\varepsilon}^{-}$ in (25) is negative. In addition, the left hand side (LHS) of (29) which involves $V_{\varepsilon}^{-}$ is the rate achievable by the save-and-transmit strategy (whose details can be found in Section V-B). For a fixed $P$ and a fixed ${\mathbb{E}}[E_{1}^{2}]$ , since $V_{\varepsilon}^{-}$ is negative, it follows from the LHS of (29) that the rate achievable by the save-and-transmit strategy will increase at a slower rate if $L$ approaches infinity at a faster rate. This can be explained by the fact that block i.i.d. EH processes with longer $L$ result in higher probabilities of “energy outage” — the source cannot output the desired codeword due to energy shortage. Similarly for a fixed $P$ , since $|V_{\varepsilon}^{-}|$ increases as the variance ${\mathrm{Var}}[E_{1}]={\mathbb{E}}[E_{1}^{2}]-P^{2}$ increases, it follows that block i.i.d. EH processes with larger variance ${\mathrm{Var}}[E_{1}]$ result in higher probabilities of “energy outage”.

Remark 4

Suppose $L=o(n)$ and fix any $\varepsilon\in(0,1/2)$ . Clearly, $V_{\varepsilon}^{+}$ in (26) is negative. For a fixed $P$ , since $V_{\varepsilon}^{+}$ is negative, it follows that the right hand side (RHS) of (29) increases at a slower rate if the following holds:

(*)

$L$ * approaches infinity at a faster rate or ${\mathrm{Var}}[E_{1}]$ is increased.*

In addition, it was shown in the previous remark that the LHS of (29) increases at a slower rate if () holds. Consequently, both the LHS and RHS of (29) increase at slower rates if () holds, which implies that the maximum rate achievable by an $(n,M_{n,\varepsilon}^{*},\varepsilon)$ -code increases at a slower rate if () holds.*

Remark 5

Suppose $L=o(n)$ . The achievability proof of Theorem 1 is based on analyzing the save-and-transmit strategy, which was illustrated in Figure 1 and will be formally discussed in Section V. Equation (25) in Theorem 1 is indeed a lower bound on the second-order coding rate achieved by the save-and-transmit strategy. By inspecting (25) we see that the two components that dominate the lower bound achieved by save-and-transmit are the saving period (contributed by the two terms with $-\mathrm{C}(P)$ in (25)) and the Gaussian noise (contributed by the term with $\varepsilon_{2}$ in (25)). If $L$ is a constant, both components contribute to the rate loss of the lower bound on the second-order coding rate achieved by save-and-transmit because the length of the saving period is $\Theta(\sqrt{L(n+L)})=\Theta(\sqrt{Ln})$ and the minimum rate backoff needed to overcome the Gaussian noise is $\Theta(\sqrt{n})$ , which correspond to the quantities $-\mathrm{C}(P)\sqrt{\varrho\log\frac{1}{\varepsilon_{1}}}$ and $\sqrt{\frac{P(\log\mathrm{e})^{2}}{L(1+P)}}\Phi^{-1}(\varepsilon_{2})$ in (25) respectively. If $L=\omega(1)$ , the term $\sqrt{\frac{P(\log\mathrm{e})^{2}}{L(1+P)}}\Phi^{-1}(\varepsilon_{2})$ vanishes and the resultant lower bound achieved by save-and-transmit is the quantity $-\mathrm{C}(P)\sqrt{\varrho\log\frac{1}{\varepsilon}}$ in (25), meaning that the length of the saving period dominates the lower bound.

III-D Remarks on Theorem 2 and Corollary 3

Remark 6

Suppose $L$ grows linearly in $n$ and $E_{1}$ has a continuous and strictly increasing cdf. Using the formula of the $\varepsilon$ -capacity provided by Corollary 3, we conclude that $C_{\varepsilon}$ is strictly increasing on $(0,1)$ , implying that the strong converse property ceases to hold. An intuitive explanation about why the strong converse property does not hold is as follows: Since the number of energy-arrival blocks $n/L$ remains constant and the cdf of $E_{1}$ is continuous and strictly increasing, the received power $\frac{1}{n}\sum_{k=1}^{n}E_{k}$ does not converge (with probability $1$ ) to a constant, which leads to the impossibility of a strong converse.

Remark 7

Consider the special case where $L=n$ and the cdf of $E_{1}$ is continuous and strictly increasing. Let $F_{E_{1}}(e)={\mathrm{Pr}}\{E_{1}\leq e\}$ be the cdf of $E_{1}$ . It then follows from Corollary 3 with the identifications $\lambda=1$ , $q=1$ and $d=0$ that

[TABLE]

for all $\varepsilon\in(0,1)$ , which is analogous to the $\varepsilon$ -capacities (outage capacities) of slow fading channels as stated in [19, Sec. 23.3.1], the $\varepsilon$ -capacities of channels with mixed states as stated in [18, Example 1], and the $\varepsilon$ -capacities of mixed channels as stated in [4, Example 3.4.2].

Remark 8

Suppose $L$ grows linearly in $n$ . The achievability proof of Theorem 2 is based on designing an adaptive save-and-transmit code that enables the source to adjust the transmission rate for each energy-arrival block according to the changes of harvested energy across different energy-arrival blocks. The adaptive save-and-transmit code was illustrated in Figure 2 and will be formally discussed in Section VII-A. Equation (36) in Theorem 2 is the coding rate achievable by the adaptive save-and-transmit strategy. By inspecting (36), we see that the main event that dominates the coding rate achievable by adaptive save-and-transmit is the “slow fading” behavior of the EH process — the energy-harvesting rate changes slowly such that the number of energy-arrival blocks stays constant as $n$ increases.

Remark 9

*Suppose $L$ grows linearly in $n$ . The converse proof of Theorem 2 is proved by considering a typical set of energy-arrival sequences followed by simplifying the conditional type-II errors of some binary hypothesis tests where the type-II errors are conditioned on the sequences in the typical set. In particular, the typical set is defined through (221) in the converse proof in Section VII-B, and the energy-arrival sequence falls into the set with high probability by (226). *

IV Proof of Theorem 1

The achievability proof of Theorem 1 relies on the following lemma, whose proof will be presented in Section V.

Lemma 4

Fix any $\varepsilon\in(0,1)$ , $\varepsilon_{1}>0$ and $\varepsilon_{2}>0$ such that

[TABLE]

Recall the definition of $\varrho$ in (24). Then for all sufficiently large $n$ , there exist a natural number

[TABLE]

and an $(n+m,M,\varepsilon)$ -code such that

[TABLE]

for some constant $\kappa_{1}$ . More specifically, $\kappa_{1}$ is defined as

[TABLE]

where

[TABLE]

In addition, equation (45) holds for any sufficiently large $n\in\mathbb{N}$ that satisfies

[TABLE]

and

[TABLE]

and $m$ can be chosen to satisfy

[TABLE]

Remark 10

Lemma 4 guarantees the existence of a carefully designed save-and-transmit scheme with the saving phase being no greater than the RHS of (51) and the message size being no less than the RHS of (45). Here, $\varepsilon_{1}$ specifies the probability of energy outage induced by energy shortage and $\varepsilon_{2}$ specifies the probability of decoding error induced by noise for the save-and-transmit scheme. As indicated by (51), the designed saving phase has to be increased as $\varepsilon_{1}$ decreases. In addition, as indicated by (45), the designed message size has to be decreased as $\varepsilon_{2}$ decreases.

The following corollary is a direct consequence of Lemma 4. The proof of Corollary 5 is given in Appendix D for completeness.

Corollary 5

Fix any $\varepsilon\in(0,1)$ . For any $\varepsilon_{1}>0$ and $\varepsilon_{2}>0$ that satisfy $\varepsilon_{1}+\varepsilon_{2}=\varepsilon$ , there exists a sequence of $(n^{*},M,\varepsilon)$ -codes such that

[TABLE]

The converse proof of Theorem 1 relies on the following lemma, whose proof will be presented in Section VI.

Lemma 6

Fix any $\varepsilon\in(0,1)$ . For any sufficiently large $n$ and any $(n,M,\varepsilon)$ -code, we have

[TABLE]

for some $\kappa_{2}=O(L)$ . More specifically, $\kappa_{2}$ is defined as

[TABLE]

where

[TABLE]

In addition, equation (53) holds for any sufficiently large $n$ that satisfies

[TABLE]

Remark 11

For any $\varepsilon\in(0,1/2)$ , since $\Phi^{-1}(\varepsilon)$ is negative, it follows from Corollary 5 and Lemma 6 that the second-order term in the asymptotic expansion of $\log M_{n,\varepsilon}^{*}$ is $-O\bigl{(}\sqrt{L(n+L)}\bigr{)}=-O\bigl{(}\sqrt{Ln}\bigr{)}$ .

We are now ready to prove Theorem 1.

Proof:

For any $\varepsilon\in(0,1)$ , the left inequality of (29) follows directly from Corollary 5 and the definition of $V_{\varepsilon}^{-}$ in (25). The right inequality of (29) follows directly from (53) in Lemma 6 and the definition of $V_{\varepsilon}^{+}$ in (26). Using (29) and Definition 6, we obtain (27) as well as (23). ∎

Remark 12

Theorem 1 no longer holds when $L=\lfloor\lambda n\rfloor$ for some $\lambda\in(0,1]$ . As we can see above, the proof of Theorem 1 hinges on the achievability and converse results stated in Lemmas 4 and 6 respectively. However, when $L=\lfloor\lambda n\rfloor$ , both Lemmas 4 and 6 do not yield the desired respective achievability and converse bounds. This is due to the fact that the length of the saving period $m$ guaranteed by (51) in Lemma 4 grows linearly with $n$ when $L=\lfloor\lambda n\rfloor$ and hence the overall rate achievable by save-and-transmit $\frac{n\mathrm{C}(P)}{m+n}$ does not converge to the desired $\mathrm{C}(P)$ . In addition, the upper bound (53) in Lemma 4 does not converge to the desired $\mathrm{C}(P)$ when $L=\lfloor\lambda n\rfloor$ .

V Proof of Lemma 4 via the Save-and-Transmit Strategy

In this section, we investigate the save-and-transmit scheme proposed in [3, Sec. IV] in the finite blocklength regime. We will use this achievability scheme to prove Lemma 4.

V-A Prerequisites

The following lemma is useful for obtaining a lower bound on the length of the energy-saving phase. The proof is deferred to Appendix E.

Lemma 7

Let $m$ and $n$ be two natural numbers. Suppose $\{X_{k}\}_{k=1}^{n}$ and $\{E_{k}\}_{k=1}^{m+n}$ are two sequences of i.i.d. random variables such that $X^{n}$ and $E^{m+n}$ are independent,

[TABLE]

and

[TABLE]

Suppose there exists a sufficiently small $t\in(0,1)$ such that ${\mathbb{E}}[X_{1}^{4}\mathrm{e}^{tX_{1}^{2}}]<\infty$ and

[TABLE]

and we define

[TABLE]

Then,

[TABLE]

In order to adapt Lemma 7 to the block energy arrival setting, we define the following quantities for each $t>0$ and each $L\in\mathbb{N}$ (cf. (60) and (61)):

[TABLE]

and

[TABLE]

The following corollary adapts Lemma 7 to the block energy arrival setting. Since the proof of the corollary is tedious, it is deferred to Appendix F.

Corollary 8

Fix a natural number $L$ . Suppose $\{X_{k}\}_{k=1}^{n}$ is a sequence of i.i.d. random variables where $X_{1}\sim\mathcal{N}(x_{1};0,P)$ , and suppose $\{E_{k}\}_{k=1}^{m+n}$ is a sequence of random variables that are distributed according to (13) (in an i.i.d.-block manner with block size $L$ ). Fix an $\varepsilon_{1}>0$ and define

[TABLE]

If

[TABLE]

and

[TABLE]

then

[TABLE]

The following lemma [17] is standard for proving achievability results in the finite blocklength regime and its proof can be found in [4, Th. 3.8.1].

Lemma 9 (Implied by Shannon’s bound [17])

Let $p_{X^{n},Y^{n}}$ be the probability distribution of a pair of random variables $(X^{n},Y^{n})$ . Let $\{X^{n}(w),Y^{n}(w)\}_{w=1}^{\infty}$ be a sequence of i.i.d. random variables where $(X^{n}(1),Y^{n}(1))\sim p_{X^{n},Y^{n}}$ . For each $\delta>0$ and each $M\in\mathbb{N}$ , we have

[TABLE]

V-B Proof of Lemma 4

Fix any $\varepsilon\in(0,1)$ , and fix any $\varepsilon_{1}>0$ and $\varepsilon_{2}>0$ such that

[TABLE]

Define $\alpha_{t}$ , $\beta_{0}$ , $\beta_{t}$ and $t_{n}$ as in (63), (64), (65) and (66) respectively. Fix a sufficiently large $n$ such that (48), (49) and (50) hold. Since (48) holds, it follows from the definition of $\beta_{0}$ in (64) that (67) also holds. Define

[TABLE]

which satisfies (68) and specifies the number of time slots which are used for saving energy. Consider the random code that uses the channel $m+n$ times as follows:

**Save-and-Transmit Random Codebook Construction

**Let $0^{m}$ denote the length- $m$ zero tuple. Define the distribution $p_{X}$ as

[TABLE]

In addition, define the distribution $p_{X^{n}}$ as $p_{X^{n}}(x^{n})\triangleq\prod_{k=1}^{n}p_{X}(x_{k})$ . Construct $M$ i.i.d. random tuples denoted by $X^{n}(1),X^{n}(2),\ldots,X^{n}(M)$ such that $X^{n}(1)$ is distributed according to $p_{X^{n}}$ , where $M$ will be carefully chosen later when we evaluate the probability of decoding error. Define

[TABLE]

for each $w\in\{1,2,\ldots,M\}$ and construct the random codebook

[TABLE]

The codebook is revealed to both the encoder and the decoder. To facilitate discussion, we let $X_{k}(w)$ and $\tilde{X}_{k}(w)$ denote the $k^{\text{th}}$ symbols in $X^{n}(w)$ and $\tilde{X}^{m+n}(w)$ respectively for each $i$ . Since the first $m$ symbols of each random codeword $\tilde{X}^{m+n}(w)$ are zeros by (74), the source will just transmit [math] with probability $1$ until time slot $m+1$ when the amount of energy $\sum_{k=1}^{m+1}E_{k}$ is available for encoding $\tilde{X}_{m+1}(W)$ .

**Encoding under the EH Constraints

**For each $w\in\{1,2,\ldots,M\}$ , recalling that $\tilde{X}_{k}(w)$ is the $k^{\text{th}}$ element of $\tilde{X}^{m+n}(w)\stackrel{{\scriptstyle\eqref{defTildeXmn}}}{{=}}(0^{m},X^{n}(w))$ , we construct recursively for $k=1,2,\ldots,m+n$ the random variable

[TABLE]

To send message $W$ which is uniformly distributed on $\{1,2,\ldots,M\}$ , the source transmits $\hat{X}_{k}(W,E^{k})$ in time slot $k$ for each $k\in\{1,2,\ldots,m+n\}$ . Note that the source transmits [math] with probability $1$ in the first $m$ times slots by (74) and (76), and the transmitted codeword $(\hat{X}_{1}(W,E^{1}),\hat{X}_{2}(W,E^{2}),\ldots,\hat{X}_{m+n}(W,E^{m+n}))$ satisfies the EH constraints (15) by (76).

**Threshold Decoding

**Upon receiving

[TABLE]

where

[TABLE]

denotes the transmitted tuple specified in (76) and $Z^{m+n}\sim\prod_{k=1}^{m+n}\mathcal{N}(z_{k};0,1)$ by the channel law (cf. (17)), the destination constructs its subtuple denoted by $\bar{Y}^{n}$ by keeping only the last $n$ symbols of $\hat{Y}^{m+n}$ . Recalling that $q_{Y|X}$ denotes the channel law and $p_{X}(x)\equiv\mathcal{N}(x;0,P)$ , we define the joint distribution

[TABLE]

and define the joint distribution $p_{X^{n},Y^{n}}$ as

[TABLE]

Then, the decoder declares $\varphi(\bar{Y}^{n})\in\{1,2,\ldots,M\}$ (with a slight abuse of notation, we write $\varphi(\bar{Y}^{n})$ instead of $\varphi(\hat{Y}^{m+n})$ ) to be the transmitted message where $\varphi(\bar{Y}^{n})$ is the decoding function defined as follows: If there exists a unique index $j$ such that

[TABLE]

then $\varphi(\bar{Y}^{n})$ is assigned the value $j$ . Otherwise, $\varphi(\bar{Y}^{n})$ is assigned a random value uniformly distributed on $\{1,2,\ldots,M\}$ .

**Calculating the Probability of Violating the EH Constraints

**Defining $\bar{X}^{n}(W,E^{m+n})$ to be the tuple containing the last $n$ symbols of $\hat{X}^{m+n}(W,E^{m+n})$ , we obtain from (74), (76) and (78) that

[TABLE]

Using Corollary 8 and noting that $E^{m+n}$ and $(W,X^{n}(W))$ are independent by construction, we obtain

[TABLE]

Using (82) and (83), we have

[TABLE]

**Calculating the Probability of Decoding Error

**Defining $\bar{Z}^{n}$ to be the tuple containing the last $n$ symbols of $Z^{m+n}$ and recalling $\bar{X}^{n}(W,E^{m+n})$ and $\bar{Y}^{n}$ are the tuples containing the last $n$ symbols of $\hat{X}^{m+n}(W,E^{m+n})$ and $\hat{Y}^{m+n}$ respectively, we obtain from (77) and (84) that

[TABLE]

where $X^{n}(W)$ and $\bar{Z}^{n}$ are independent and $\bar{Z}^{n}\sim\prod_{k=1}^{n}\mathcal{N}(\bar{z}_{k};0,1)$ . Following (81) and (85), we define the events

[TABLE]

and consider the following chain of inequalities for each $w\in\{1,2,\ldots,M\}$ :

[TABLE]

where

(a)

follows from symmetry of the random codebook construction and the union bound. 2. (b)

follows from Lemma 9 and (86). 3. (c)

follows from the fact that $X^{n}(1)$ and $\bar{Z}^{n}$ are independent copies of $X_{1}(1)$ and $\bar{Z}_{1}$ respectively by construction.

**Applying the Berry-Esséen Theorem

**Using (79), (73) and (17) we conclude that $X\sim\mathcal{N}(x;0,P)$ and $Z\triangleq Y-X$ are independent, $Z\sim\mathcal{N}(z;0,1)$ , and

[TABLE]

In order to ensure the first term in (90) can be bounded above by a simple term, we first define the mean $\mu$ , the variance $\sigma^{2}$ and the third absolute moment $T$ of $\log\left(\frac{p_{Y|X}(Y|X)}{p_{Y}(Y)}\right)$ as follows: $\mu\triangleq\frac{1}{2}\log\left(1+P\right)$ , $\sigma\triangleq\sqrt{\frac{P(\log\mathrm{e})^{2}}{1+P}}$ and

[TABLE]

where the derivation of the last inequality is relegated to Appendix G-A. Clearly,

[TABLE]

After defining $\mu$ , $\sigma$ and $T$ , we choose $M$ to be the unique integer that satisfies

[TABLE]

where

[TABLE]

Following (90), we obtain the following inequality where the random variables are distributed according to $\prod_{k=1}^{n}p_{X_{k}(1)}p_{\bar{Z}_{k}}$ :

[TABLE]

where

(a)

follows from (86) and (95). 2. (b)

follows from the Berry-Esséen theorem for i.i.d random variables [20], i.e., $\left|{\mathrm{Pr}}\left\{\frac{\sum_{k=1}^{n}V_{k}-n\mu}{\sqrt{n\sigma^{2}}}\leq a\right\}-\Phi(a)\right|\leq\frac{T}{\sigma^{3}\sqrt{n}}$ for all $a\in\mathbb{R}$ where $\mu$ , $\sigma^{2}$ and $T$ denote the mean, the variance and the third absolute moment of $V_{k}$ respectively.

We are ready to compute the probability of decoding error as follows, where the random variables are distributed according to $p_{W,X^{n}(W)}p_{\bar{Z}^{n}}p_{\bar{Y}^{n}|W,X^{n}(W),\bar{Z}^{n}}$ :

[TABLE]

where (a) follows from the threshold decoding rule (cf. (81) and (86)), (90) and (98).

**Obtaining a Lower Bound on the Message Size $\boldsymbol{M}$

**Using (95), (102) and the simple fact that $\log(M+1)\leq\log M+1$ , we conclude that the constructed code is an $(n+m,M,\varepsilon)$ -code that satisfies

[TABLE]

Using Taylor’s theorem together with the fact by (50) that $\left[\varepsilon_{2}-\frac{T}{\sigma^{3}\sqrt{n}}-\frac{1}{\sqrt{n}},\varepsilon_{2}\right]\subseteq\left[\varepsilon_{2}^{2},\varepsilon_{2}\right]$ , we obtain

[TABLE]

where the derivation of (104) is relegated to Appendix G-B. Combining (103) and (105) and recalling the definition of $\kappa_{1}$ in (46), we have

[TABLE]

**Obtaining an Upper Bound on the Length of Saving Phase $\boldsymbol{m}$

**Since the constructed $(n+m,M,\varepsilon)$ -code satisfies (45) by (106), it remains to show that $m$ satisfies (51). To this end, recall the definition of $m$ in (72) and consider the following bounds on $t_{n}$ , $\alpha_{t_{n}}$ and $\beta_{t_{n}}$ :

[TABLE]

and

[TABLE]

In order to obtain an upper bound on $m$ , consider the following chain of inequalities:

[TABLE]

where

(a)

follows from (115) and the definition of $\beta_{0}$ in (64). 2. (b)

follows from (112) and the definition of $\beta_{0}$ in (64).

Consequently, the constructed $(n+m,M,\varepsilon)$ -code satisfies (45) and (51) by (106) and (118) respectively. This completes the proof.

VI Proof of Lemma 6 via Binary Hypothesis Testing

VI-A Prerequisites

The following definition concerning the non-asymptotic fundamental limits of a simple binary hypothesis test is standard. See for example [21, Section III-E].

Definition 8

Let $p_{X}$ and $q_{X}$ be two probability distributions defined on some common alphabet $\mathcal{X}$ . Let

[TABLE]

be the set of randomized binary hypothesis tests between $p_{X}$ and $q_{X}$ where $\{Z=0\}$ indicates the test chooses $q_{X}$ , and let $\delta\in[0,1]$ be a real number. The minimum type-II error in a simple binary hypothesis test between $p_{X}$ and $q_{X}$ with type-I error no larger than $1-\delta$ is defined as

[TABLE]

The existence of a minimizing test $r_{Z|X}$ is guaranteed by the Neyman-Pearson lemma.

We state in the following lemma and proposition some important properties of $\beta_{\delta}(p_{X}\|q_{X})$ , which are crucial for the proof of Theorem 1. The proof of the two statements in the following lemma can be found in [22, Lemma 1] and [23, Sec. 2.3] respectively.

Lemma 10

Let $p_{X}$ and $q_{X}$ be two probability distributions defined on some $\mathcal{X}$ , and let $g$ be a function whose domain contains $\mathcal{X}$ . Then, the following two statements hold:

(Data processing inequality (DPI)) $\beta_{\delta}(p_{X}\|q_{X})\leq\beta_{\delta}(p_{g(X)}\|q_{g(X)})$ . 2. 2.

For all $\xi>0$ , $\beta_{\delta}(p_{X}\|q_{X})\geq\frac{1}{\xi}\left(\delta-\int_{\mathcal{X}}p_{X}(x)\boldsymbol{1}\left\{\frac{p_{X}(x)}{q_{X}(x)}\geq\xi\right\}\mathrm{d}x\right)$ .

The proof of the following proposition is similar to Lemma 3 in [22] and therefore omitted.

Proposition 11

Let $p_{U,V}$ and $s_{V}$ be two probability distributions defined on $\mathcal{W}\times\mathcal{W}$ and $\mathcal{W}$ respectively for some $\mathcal{W}$ , and let $p_{U}$ be the marginal distributions of $p_{U,V}$ . Suppose $p_{U}$ is the uniform distribution, and let

[TABLE]

be a real number in $[0,1)$ . Then,

[TABLE]

VI-B Proof of Lemma 6

Fix an $\varepsilon\in(0,1)$ , an $\bar{n}\in\mathbb{N}$ which is larger than the RHS of (56) and an $(\bar{n},M,\varepsilon)$ -code for the AWGN EH channel. Using Definition 1, we have

[TABLE]

for the $(\bar{n},M,\varepsilon)$ -code. Define

[TABLE]

to be the smallest positive integer such that $\bar{n}+\Delta$ is a multiple of $L$ . Then, we can always construct an $(\bar{n}+\Delta,M,\varepsilon)$ -code by appending carefully chosen $X_{\bar{n}+1},X_{\bar{n}+2},\ldots,X_{\bar{n}+\Delta}$ to each transmitted sequence $X^{\bar{n}}$ generated by the $(\bar{n},M,\varepsilon)$ -code such that

[TABLE]

The technique of transforming the peak power inequality constraint (123) to a power equality constraint (125) by appending an extra symbol has been employed in [21, Lemma 39] and [24, Theorem 4.4] (and is called the Yaglom-map trick). To simplify notation, we let

[TABLE]

where $n$ is a multiple of $L$ and satisfies (56).

**Obtaining a Lower Bound on the Error Probability in Terms of the Type-II Error of a Hypothesis Test

**Let $p_{W,E^{n},X^{n},Y^{n},\hat{W}}$ be the probability distribution induced by the $(n,M,\varepsilon)$ -code constructed above, where $p_{W,E^{n},X^{n},Y^{n},\hat{W}}$ can be expressed according to (18). In view of (125), we assume without loss of generality that

[TABLE]

for all Borel measurable $\mathcal{A}\subseteq\mathcal{W}\times\mathbb{R}_{+}^{n}\times\mathbb{R}^{n}\times\mathbb{R}^{n}$ . All the probability and expectation terms in the rest of this proof are evaluated according to $p_{W,E^{n},X^{n},Y^{n},\hat{W}}$ unless specified otherwise. Define

[TABLE]

where

[TABLE]

It follows from Proposition 11 and Definition 1 with the identifications $U\equiv W$ , $V\equiv\hat{W}$ , $p_{U,V}\equiv p_{W,\hat{W}}$ , $|\mathcal{W}|\equiv M$ and $\alpha\equiv{\mathrm{Pr}}\{W\neq\hat{W}\}\leq\varepsilon$ that

[TABLE]

**Using the DPI to Introduce the Channel Inputs and Outputs

**Using the DPI of $\beta_{1-\varepsilon}$ in Lemma 10, we have

[TABLE]

Fix a $\xi_{n}>0$ to be specified later. Since

[TABLE]

it follows from (130), the definition of $s_{Y^{n},\hat{W}}$ in (128), (131) and Lemma 10 that

[TABLE]

**Simplifying the Non-Asymptotic Bound

**Combining (17) and (129), we have

[TABLE]

for each $k\in\{1,2,\ldots,n\}$ . Due to the power equality constraint imposed on the codewords, we have

[TABLE]

Letting

[TABLE]

for each $k\in\{1,2,\ldots,n\}$ , we obtain from (134) and (135) that

[TABLE]

Combining (133) and (137), we have

[TABLE]

**Evaluating the Distribution of the Sum of Random Variables $\sum_{k=1}^{n}U_{k}$

**In order to simplify (138), we now investigate the distribution of the sum of random variables $\sum_{k=1}^{n}U_{k}$ . We will show in the following that the distribution of $\sum_{k=1}^{n}U_{k}$ can be evaluated in closed form. Define the function $\lambda:\mathbb{R}_{+}\times\mathbb{R}\times\mathbb{R}\rightarrow\mathbb{R}$

[TABLE]

We begin evaluating the distribution of $\sum_{k=1}^{n}U_{k}$ by examining the distribution of $\sum_{k=1}^{n}\lambda(E_{k},X_{k},Y_{k})$ (cf. (136)) as follows. Let

[TABLE]

be the characteristic function of $\sum_{k=1}^{n}\lambda(E_{k},X_{k},Y_{k})$ where $i$ denotes the imaginary unit. In order to evaluate a closed-form expression for (140), we write

[TABLE]

In order to simplify the RHS of (142), we consider the following chain of equalities for each $r\in\{2,3,\ldots,n\}$ :

[TABLE]

Since $Y_{r}-X_{r}$ is a standard normal random variable which is independent of $(E^{n},X^{r-1},Y^{r-1})$ by the channel law in (17), it follows from straightforward calculations based on (139) that the following equality holds with probability $1$ for each $i\in\{2,3,\ldots,n\}$ :

[TABLE]

Using (144) and (146), we have the following equality that holds with probability $1$ for each $r\in\{2,3,\ldots,n\}$ :

[TABLE]

Combining (142) and (147), we obtain

[TABLE]

Let $\{Z_{k}\}_{k=1}^{n}$ be $n$ independent copies of the standard normal random variable. Straightforward calculations reveal that

[TABLE]

Therefore,

[TABLE]

by (148) and (149), i.e., the characteristic functions of $\sum_{k=1}^{n}\lambda(E_{k},X_{k},Y_{k})$ and $\sum_{k=1}^{n}(-PZ_{k}^{2}+2\sqrt{E_{k}}Z_{k}+E_{k})$ are equal. Consequently, the probability distributions of $\sum_{k=1}^{n}\lambda(E_{k},X_{k},Y_{k})$ and $\sum_{k=1}^{n}(-PZ_{k}^{2}+2\sqrt{E_{k}}Z_{k}+E_{k})$ are equal almost everywhere, which implies from (136) and (139) that the probability distributions of $\sum_{k=1}^{n}\frac{\log\mathrm{e}}{2(1+P)}(-PZ_{k}^{2}+2\sqrt{E_{k}}Z_{k}+E_{k})$ and $\sum_{k=1}^{n}U_{k}$ are equal almost everywhere, which then implies from (138) that

[TABLE]

We recall from (124) that $n=\bar{n}+\Delta$ is a multiple of $L$ and define

[TABLE]

for each $\ell\in\{1,2,\ldots,n/L\}$ (cf. the definition of $b_{\ell}$ in (2)). Then, equation (151) can be rewritten as

[TABLE]

**Applying the Berry-Esséen Theorem

**Using the facts that $\{Z_{k}\}_{k=1}^{n}$ are i.i.d., $\{E_{b_{\ell}+1}\}_{\ell=1}^{n/L}$ are i.i.d. and

[TABLE]

for each $\ell\in\{1,2,\ldots,n/L\}$ , we conclude in view of (152) that $\{\tilde{V}_{\ell}\}_{\ell=1}^{n/L}$ are i.i.d. where

[TABLE]

In order to invoke the Berry-Esséen Theorem to bound the probability term in (153), we define the following quantities related to $\tilde{V}_{1}$ :

[TABLE]

and

[TABLE]

where the derivation of the last inequality is relegated to Appendix H-A. Recalling the definition of $\tau_{2}$ in (55), we use the Berry-Esséen theorem for i.i.d. random variables [20] to obtain

[TABLE]

where the argument of $\Phi^{-1}$ satisfies

[TABLE]

Following (153) and letting

[TABLE]

we can express (153) as

[TABLE]

which implies from the definition of $\sigma$ in (158), the inequality in (162) and the definition of $\tilde{V}_{\ell}$ in (152) that

[TABLE]

Using Taylor’s theorem together with the fact by (50) that $\left[\varepsilon,\varepsilon+2\tau_{2}\sqrt{\frac{L}{n}}\right]\stackrel{{\scriptstyle\eqref{defTau2*}}}{{\subseteq}}\left[\varepsilon,\varepsilon+(1-\varepsilon)^{2}\right]$ , we obtain

[TABLE]

whose derivation is relegated to Appendix H-B. Combining (166) and (167) and recalling the definition of $\kappa_{2}$ in (54), we have

[TABLE]

Using (168) and the fact by (124) and (126) that $n\leq\bar{n}+L$ , we have

[TABLE]

This completes the proof.

VII When the Length of Each Energy Arrival Block Grows Linearly in Blocklength

This section focuses on the scenario $L=\lfloor\lambda n\rfloor$ for some real constant $\lambda\in(0,1]$ . Define

[TABLE]

to be the number of length- $L$ energy-arrival blocks. The total number of energy-arrival blocks is $\rho+1$ where the length of each of the first $\rho$ energy-arrival blocks equals $L$ and the length of the $(\rho+1)^{\text{th}}$ energy-arrival block equals

[TABLE]

The following proposition gives us a lower bound and an upper bound on the length of the $(\rho+1)^{\text{th}}$ energy-arrival block, which are useful for the achievability and the converse proofs of Theorem 2 respectively. The proof of Proposition 12 is straightforward and is deferred to Appendix I.

Proposition 12

For all sufficiently large $n\in\mathbb{N}$ ,

[TABLE]

and

[TABLE]

where $n-\rho L$ is the length of the $(\rho+1)^{\text{th}}$ energy-arrival block and $q$ and $d$ were defined in (34) and (35) respectively.

VII-A Achievability proof of Theorem 2

In this section, we propose an adaptive save-and-transmit code which will be used to prove the achievability part of Theorem 2. The adaptive save-and-transmit code enables the source to transmit information at a rate close to $\mathrm{C}(E_{b_{\ell+1}})$ for each block $\ell\in\{1,2,\ldots,\rho+1\}$ . For each block $\ell$ , since the destination does not know the EH random variables $E_{b_{\ell+1}}$ , the source first needs to quantize $E_{b_{\ell+1}}$ and convey the quantized version to the destination before adjusting the transmission rate. To facilitate discussion, we let

[TABLE]

and define the set of quantization points

[TABLE]

In addition, define the quantization mapping $g^{\Delta}:\mathbb{R}_{+}\rightarrow\Gamma$ such that $g^{\Delta}(a)$ is the unique quantization point that satisfies

[TABLE]

In order to enable communication at a rate close to $\mathrm{C}(g^{\Delta}(E_{b_{\ell}+1}))$ and with error probability $O(\frac{1}{L^{1/6}})$ in block $\ell$ for each $\ell\in\{1,2,\ldots,\rho+1\}$ , we propose to use an adaptive save-and-transmit code in each block so that node $\mathrm{s}$ can adapt the coding rate to the EH process.

Definition 9

An $(L,\Delta,\varepsilon)$ -adaptive code consists of the following:

A message alphabet $\mathcal{U}^{\infty}\triangleq\{0,1\}^{\infty}$ . The message $U^{\infty}$ is a sequence of i.i.d. uniform bits. 2. 2.

An adaptive encoding function $f:\Gamma\times\mathcal{U}^{\infty}\rightarrow\mathbb{R}^{L}$ which depends on $g^{\Delta}(E_{1})$ such that

[TABLE]

and

[TABLE]

for each $k\in\{1,2,\ldots,L\}$ . 3. 3.

A decoding function $\varphi:\mathbb{R}^{L}\rightarrow\mathcal{U}^{\infty}$ for decoding $W$ at node $\mathrm{d}$ such that the message estimate $\hat{U}^{\infty}$ is produced by setting $\hat{U}^{\infty}\triangleq\varphi(Y^{L})$ . Define the mapping $\gamma:\mathbb{N}\times\mathbb{R}_{+}$ as

[TABLE]

Then, the probability of decoding error adapted to $g^{\Delta}(E_{1})$ , which is defined as

[TABLE]

is no larger than $\varepsilon$ .

By Definition 9, node $\mathrm{s}$ can use an $(L,\Delta,\varepsilon)$ -adaptive code to transmit $2^{\gamma(L,E_{1})}$ bits to node $\mathrm{d}$ with small error probability in each length- $L$ energy-arrival block. We use “adaptive” to describe the code because the number of bits that can be conveyed by the code changes with $E_{1}$ . We will prove the achievability part of Theorem 2 by using an adaptive code that has the following two features for every $\ell\in\{1,2,\ldots,\rho\}$ :

(i)

Each of the first $\left\lceil\sqrt{L}\right\rceil$ symbols in the $\ell^{\text{th}}$ block sent by node $\mathrm{s}$ is the constant symbol $\sqrt{g^{\Delta}(E_{b_{\ell}+1})}$ so that with probability larger than $1-\frac{1}{L^{1/6}}$ , the destination can estimate $g^{\Delta}(E_{b_{\ell}+1})$ correctly. 2. (ii)

In the remaining $L-\left\lceil\sqrt{L}\right\rceil$ symbols in the $\ell^{\text{th}}$ block, node $\mathrm{s}$ intends to use a Gaussian codebook with average power $g^{\Delta}(E_{b_{\ell}+1})$ to transmit i.i.d. uniform bits at a rate close to $\mathrm{C}(g^{\Delta}(E_{b_{\ell}+1}))$ and with error probability $\leq\frac{1}{L^{1/6}}$ .

Feature (i) is based on Proposition 13 to be presented later. Feature (ii) will be established through proving the existence of an adaptive code with the desired properties in Lemma 15. The proof of the following proposition is simple and thus deferred to Appendix J.

Proposition 13

[TABLE]

The following lemma is useful for proving the achievability part of Theorem 2. Since Lemma 14 is a direct consequence of [15, Th. 1], its proof is relegated to Appendix K.

Lemma 14

The following statement holds for any sufficiently large $L\in\mathbb{N}$ . Fix an arbitrary $\tilde{P}>0$ and suppose $E_{1}=E_{2}=\ldots=E_{L}=\tilde{P}$ holds with probability $1$ . Then, there exists an $\left(L-\left\lceil\sqrt{L}\right\rceil,M,\frac{1}{\sqrt{L}}\right)$ -code such that

[TABLE]

To facilitate discussion, we call the $\left(L-\left\lceil\sqrt{L}\right\rceil,M,\frac{1}{\sqrt{L}}\right)$ -code an $\left(L-\left\lceil\sqrt{L}\right\rceil,M,\tilde{P},\frac{1}{\sqrt{L}}\right)$ -code.

The following lemma is based on Proposition 13 and Lemma 14.

Lemma 15

For any sufficiently large $L\in\mathbb{N}$ , there exists an $\left(L,\Delta,\frac{1}{L^{1/6}}+\frac{1}{\sqrt{L}}\right)$ -adaptive code.

Proof:

We construct an $\left(L,\Delta,\frac{1}{L^{1/6}}+\frac{1}{\sqrt{L}}\right)$ -adaptive code in two steps as follows.

In each of the first $\left\lceil\sqrt{L}\right\rceil$ time slots in the length- $L$ block, node $\mathrm{s}$ sends the constant symbol $\sqrt{g^{\Delta}(E_{1})}$ , which is always possible because $g^{\Delta}(E_{1})\leq E_{1}$ by (176). Upon receiving $Y^{\left\lceil\sqrt{L}\right\rceil}$ , node $\mathrm{d}$ produces an estimate of $g^{\Delta}(E_{1})$ , denoted by $\hat{g}^{\Delta}(E_{1})$ , by setting

[TABLE]

It follows from the definition of $\Gamma$ in (175), Proposition 13 and (183) that

[TABLE] 2. 2.

In the remaining $L-\left\lceil\sqrt{L}\right\rceil$ time slots, node $\mathrm{s}$ will choose an $\left(L-\left\lceil\sqrt{L}\right\rceil,M,\frac{1}{\sqrt{L}}\right)$ -code based on the knowledge of $E_{1}$ as follows: Node $\mathrm{s}$ calculates $g^{\Delta}(E_{1})$ and transmits $\gamma(L,E_{1})$ i.i.d. uniform bits $U^{\gamma(L,E_{1})}$ using a predetermined $\left(L-\left\lceil\sqrt{L}\right\rceil,M,g^{\Delta}(E_{1}),\frac{1}{\sqrt{L}}\right)$ -code whose existence is guaranteed by Lemma 14. The encoding strategy of $\mathrm{s}$ is known to node $\mathrm{d}$ , which will decode the bits using the decoder of the $\left(L-\left\lceil\sqrt{L}\right\rceil,M,\hat{g}^{\Delta}(E_{1}),\frac{1}{\sqrt{L}}\right)$ -code predetermined by node $\mathrm{s}$ and output the bits estimate $\hat{U}^{\gamma(L,E_{1})}$ . By the definition of the codes,

[TABLE]

For the adaptive code described above, it follows from (183) and (185) together with the union bound that

[TABLE]

which implies that the adaptive code is an $\left(L,\Delta,\frac{1}{L^{1/6}}+\frac{1}{\sqrt{L}}\right)$ -adaptive code. ∎

We are ready to prove the achievability part of Theorem 2.

Proof:

Fix an $\varepsilon\in(0,1)$ . Our goal is to prove

[TABLE]

It suffices to show that

[TABLE]

for all $\eta>0$ . Fix an arbitrary $\eta>0$ . By the definition of $\underline{R}_{\varepsilon}$ in (36), we have

[TABLE]

for some $\delta>0$ . Let $\chi_{\delta}>0$ be a sufficiently large number such that

[TABLE]

We want to show that there exists a sequence of $(n,M_{n},\varepsilon)$ -codes such that

[TABLE]

which will then imply (188). To this end, fix a sufficiently large $n\in\mathbb{N}$ such that (172) holds, (173) holds,

[TABLE]

and

[TABLE]

where $\Delta$ is as defined in (174). The number of i.i.d. uniform bits that can be transmitted by the code is chosen to be

[TABLE]

In the rest of the proof, we are devoted to constructing an $(n,M_{n})$ -code followed by showing that the error probability is bounded above by $\varepsilon$ .

**Construction of an $(n,M_{n})$ -code:

**Recall that the length of each of the first $\rho$ blocks is $L=\lfloor\lambda n\rfloor$ . To facilitate discussion, define

[TABLE]

to be the length of the $\ell^{\text{th}}$ block for each $\ell\in\{1,2,\ldots,\rho\}$ , and define

[TABLE]

to be a lower bound on the length of the $(\rho+1)^{\text{th}}$ block (due to (173)). Since $\rho L+L_{\rho+1}\leq n$ by construction, we will construct an $(n,M_{n})$ -code by concatenating $\rho$ blocks of length- $L$ adaptive codes and one block of length- $L_{\rho+1}$ adaptive code as described below. The message of the $(n,M_{n})$ -code is a sequence of $\log M_{n}$ i.i.d. uniform bits denoted by $U^{\log M_{n}}$ . Then for each block $\ell\in\{1,2,\ldots,\rho+1\}$ , node $\mathrm{s}$ uses an $\left(L_{\ell},\Delta,\frac{1}{L_{\ell}^{1/6}}+\frac{1}{\sqrt{L_{\ell}}}\right)$ -adaptive code to transmit $\gamma(L,E_{b_{\ell}+1})$ i.i.d. uniform bits. A decoding error is declared if one of the following cases occurs:

(i)

The total number of transmitted bits is less than $\log M_{n}$ , i.e., the following event occurs:

[TABLE]

(ii)

Provided that $\mathcal{F}^{c}$ occurs, the bits estimates output by $\mathrm{d}$ denoted by $\hat{U}^{\log M_{n}}$ are not equal to the transmitted bits, i.e., the following event occurs:

[TABLE]

**Analysis of the Error Probability:

**In the rest of the proof, all the probability terms are evaluated according to the distribution induced by the $(n,M_{n})$ -code constructed above. Since the $(n,M_{n})$ -code is a concatenation of $\rho$ blocks of $\left(L,\Delta,\frac{1}{L^{1/6}}+\frac{1}{\sqrt{L}}\right)$ adaptive codes and one block of $\left(L_{\rho+1},\Delta,\frac{1}{L_{\rho+1}^{1/6}}+\frac{1}{\sqrt{L_{\rho+1}}}\right)$ adaptive code, it follows from Definition 9 and the union bound that

[TABLE]

which together with (172) and (197) implies that the error probability of the $(n,M_{n})$ -code is bounded above as

[TABLE]

In order to obtain an upper bound on the last term in (200) in terms of $\mathrm{C}(E_{b_{\ell}+1})$ , we consider

[TABLE]

where

(a)

follows from the definition of $g^{\Delta}$ in (176) and the fact that

[TABLE] 2. (b)

follows from (190) and the union bound.

On the other hand, combining (189) with the fact that $\{E_{b_{\ell}+1}\}_{\ell=1}^{n/L}$ are i.i.d., we have

[TABLE]

which implies that

[TABLE]

which then together with (195), (196) and (194) implies that

[TABLE]

Using (209), (190) and the union bound, we obtain

[TABLE]

Combining (205), (210) and (193), we have

[TABLE]

Using (200), (192) and (211), we have

[TABLE]

Therefore, the constructed $(n,M_{n})$ -code is an $(n,M_{n},\varepsilon)$ -code where $M_{n}$ satisfies (194). Consequently, for any $\eta>0$ , there exists a sequence of $(n,M_{n},\varepsilon)$ -codes where $M_{n}$ satisfies (194) such that (191) holds, which then implies (188). Since $\eta>0$ is arbitrary, we have (187). ∎

VII-B Converse Proof of Theorem 2

Fix an $\varepsilon\in(0,1)$ . Our goal is to prove

[TABLE]

It suffices to show that

[TABLE]

for all $\eta>0$ . Fix an arbitrary $\eta>0$ and an $\varepsilon$ -achievable rate $R$ . By the definition of $\overline{R}_{\varepsilon}$ in (37),

[TABLE]

for some $\delta>0$ . Let $\chi_{\delta}$ be a sufficiently large number such that

[TABLE]

In addition, since $R$ is $\varepsilon$ -achievable, it follows from Definition 4 that there exists a sequence of $(n,M,\varepsilon)$ -codes such that

[TABLE]

Fix a sufficiently large $n\in\mathbb{N}$ such that (172) holds, (173) holds and

[TABLE]

and fix the corresponding $(n,M,\varepsilon)$ -code. Let $p_{W,E^{n},X^{n},Y^{n},\hat{W}}$ be the probability distribution induced by the $(n,M,\varepsilon)$ -code. Unless specified otherwise, all the probability, expectation and variance terms are evaluated according to $p_{W,E^{n},X^{n},Y^{n},\hat{W}}$ . Since $\{E_{b_{\ell}+1}\}_{\ell=1}^{\infty}$ are i.i.d. by assumption, it follows from (216) that

[TABLE]

Define

[TABLE]

and

[TABLE]

Since the average error probability of the code is no larger than $\varepsilon$ , we have

[TABLE]

Consider

[TABLE]

where (a) follows from (222) and Markov’s inequality, which implies that

[TABLE]

**Obtaining a Lower Bound on the Error Probability in Terms of the Type-II Error of a Hypothesis Test

**Define

[TABLE]

for all $e^{n}\in\Psi_{\delta}$ where

[TABLE]

It follows from Proposition 11 and Definition 1 with the identifications $U\equiv W$ , $V\equiv\hat{W}$ , $p_{U,V}\equiv p_{W,\hat{W}|E^{n}=e^{n}}$ , $|\mathcal{W}|\equiv M$ and $\alpha\equiv{\mathrm{Pr}}\left\{\left.W\neq\hat{W}\right.|E^{n}=e^{n}\right\}\leq\varepsilon(e^{n})$ that

[TABLE]

for all $e^{n}\in\Psi_{\delta}$ .

**Using the DPI to Introduce the Channel Inputs and Outputs

**Using the DPI of $\beta_{1-\varepsilon(e^{n})}$ in Lemma 10, we have

[TABLE]

for all $e^{n}\in\Psi_{\delta}$ . For each $e^{n}\in\Psi_{\delta}$ , fix a $\xi(e^{n})>0$ to be specified later. Since

[TABLE]

it follows from (229), the definition of $s_{Y^{n},\hat{W}|E^{n}=e^{n}}$ in (227), (230) and Lemma 10 that

[TABLE]

for all $e^{n}\in\Psi_{\delta}$ .

**Simplifying the Non-Asymptotic Bound

**Combining (17) and (228), we have

[TABLE]

for each $e^{n}\in\Psi_{\delta}$ and each $k\in\{1,2,\ldots,n\}$ . Let

[TABLE]

for each $k\in\{1,2,\ldots,n\}$ . Then, it follows from (234) and the energy-harvesting constraints (15) that

[TABLE]

Combining (233) and (236), we have for each $e^{n}\in\Psi_{\delta}$

[TABLE]

In order to simplify the RHS of (237), we choose

[TABLE]

recall the definition of $\Psi_{\delta}$ in (221) and rewrite (237) as

[TABLE]

**Applying Chebyshev’s inequality

**Following (239), we evaluate for each $e^{n}\in\Psi_{\delta}$

[TABLE]

and

[TABLE]

where (a) and (b) are due to the definition of $U_{k}(e_{k})$ in (235) and the following fact: $\{Y_{k}-X_{k}\}_{k=1}^{n}$ are i.i.d. standard normal random variables that are independent of $(E^{n},X^{n})$ . Using (240), (244) and Chebyshev’s inequality, we have for each $e^{n}\in\Psi_{\delta}$

[TABLE]

Combining (239) and (246), we have

[TABLE]

Since the number of energy-arrival blocks of length $L$ equals $q$ by (172) and the length of the last block is no larger than $nd+\left\lfloor\frac{1}{\lambda}\right\rfloor$ by (173), it follows from (247) that

[TABLE]

In order to simplify the first term in (249), we define $\phi:\mathbb{R}_{+}^{n}\rightarrow\mathbb{R}$ as

[TABLE]

and consider the following chain of inequalities where the sets $\Psi$ and $\Omega$ are assumed to be Borel measurable:

[TABLE]

Combining (247) and (255), we obtain

[TABLE]

which together with (217) implies that

[TABLE]

Since $\eta>0$ is arbitrary and $R$ is an arbitrary $\varepsilon$ -achievable rate, the inequality (213) follows from (257).

VIII Concluding Remarks and Future Work

This paper studies the $\varepsilon$ -capacity and the second-order coding rate for the AWGN EH channel with an infinite battery under the assumption that the error probabilities do not vanish as the blocklength increases. The EH process is assumed to be block i.i.d. where the blocks have length $L$ .

For the case where $L$ is a constant or grows sublinearly in the blocklength $n$ , we have the following two findings stated in Theorem 1: (i) The $\varepsilon$ -capacity is the same for all $\varepsilon\in(0,1)$ , i.e., the strong converse holds; (ii) A lower bound and an upper bound on the second-order coding rate have been obtained. where the lower bound is obtained by analyzing the conventional save-and-transmit strategy [3].

For the case where $L$ grows linearly in $n$ , we prove in Theorem 2 a lower bound and an upper bound on the $\varepsilon$ -capacity.

Two interesting directions for future research are obtaining the full characterization of the $\varepsilon$ -capacity and good approximations on the second-order coding rate for $L=\lambda n$ , i.e., a strengthening of Theorem 2. In addition, while this work investigates only optimal codes which have high decoding complexity, future research may compare the performances between an industrial low-complexity yet (first-order) optimal code for the AWGN channel and its (adaptive) save-and-transmit counterpart for the AWGN EH channel (as performed for the binary-input EH channel in [25]). Finally, one may explore analogies among AWGN EH channels, slow fading channels and channels with mixed states for the case in which $L$ grows linearly in $n$ .

Appendix A Proof of Corollary 1

In view of Theorem 1, it suffices to prove that $V_{\varepsilon}^{--}\leq V_{\varepsilon}^{-}$ for the case where $L$ is a constant. To this end, we let $L$ be a fixed constant and consider the following chain of inequalities:

[TABLE]

where

(a)

is due to the easily verified fact that

[TABLE] 2. (b)

is due to the fact that $\varepsilon<1/2<\min\big{\{}1-\frac{1}{\sqrt{2\pi\mathrm{e}}},\frac{1}{\sqrt{e}}\big{\}}$ .

Appendix B Proof of Corollary 2

Fix any $\varepsilon\in(0,\Phi(-1))$ . Using the easily verified fact that $\Phi^{-1}(\varepsilon)\leq-\sqrt{2\ln\frac{1}{\varepsilon}}$ , we obtain that

[TABLE]

which together with (31) and (30) implies that (32) holds. The rest of the proof is dedicated to showing $\Phi^{-1}(\varepsilon)\leq-\sqrt{2\ln\frac{1}{\varepsilon}}$ . Let $a=\Phi^{-1}(\varepsilon)$ . Since $a\leq-1$ due to the assumption that $\varepsilon\leq\Phi(-1)$ , we have $\varepsilon=\Phi(a)\leq\mathrm{e}^{-a^{2}/2}$ , which then implies that $a\leq-\sqrt{2\ln\frac{1}{\varepsilon}}$ .

Appendix C Proof of Corollary 3

Suppose $E_{1}$ has a continuous and strictly increasing cdf (i.e., the mapping $a\mapsto{\mathrm{Pr}}\{E_{1}\leq a\}$ is continuous and strictly increasing on $[0,\infty)$ ). It follows that

[TABLE]

is continuous in $r$ and strictly increasing, which then implies that

[TABLE]

which together with Theorem 2 implies that (39) holds for all $\varepsilon\in(0,1)$ .

Appendix D Proof of Corollary 5

To facilitate discussion, let $m_{n}$ denote the RHS of (51), and simple calculations reveal that

[TABLE]

For each $n^{*}\in\mathbb{N}$ , let $\tilde{n}$ be the unique natural number that satisfies

[TABLE]

It is clear from (272) and (273) that

[TABLE]

By Lemma 4, there exists for each sufficiently large $n\in\mathbb{N}$ an $(n+\lfloor m_{n}\rfloor,M,\varepsilon)$ -code such that (45) holds, which implies from the left inequality in (273) that for each sufficiently large $n^{*}\in\mathbb{N}$ , there exists an $(n^{*},M,\varepsilon)$ -code such that

[TABLE]

which then implies from the right inequality in (273) that

[TABLE]

Combining the facts that

[TABLE]

and

[TABLE]

we conclude that

[TABLE]

This completes the proof.

Appendix E Proof of Lemma 7

In this proof, all the probability, expectation and variance terms are evaluated according to $p_{X^{n}}p_{E^{m+n}}$ . In order to obtain an upper bound on ${\mathrm{Pr}}\left\{\bigcup_{k=1}^{n}\left\{\sum_{i=1}^{k}X_{i}^{2}\geq\sum_{i=1}^{m+k}E_{i}\right\}\right\}$ , we construct the following sequence denoted by $\{B_{k}\}_{k=1}^{m+n}$ . For each $k\in\{1,2,\ldots,m+n\}$ , define $B_{k}$ recursively444The construction of $\{B_{k}\}_{k=1}^{m+n}$ is inspired by a standard proof of Kolmogorov’s inequality. as

[TABLE]

By inspecting (282), we have

[TABLE]

which implies that

[TABLE]

Define for each $k\in\{1,2,\ldots,m+n\}$

[TABLE]

Following (284), we consider the following chain of inequalities for any $t>0$ :

[TABLE]

where (a) follows from Markov’s inequality. In order to simplify the RHS of (289), we consider the following chain of inequalities for each $i\in\{1,\ldots,n\}$ :

[TABLE]

where (a) follows from the independence between $(E_{m+i},X_{i})$ and $U^{m+i-1}$ due to the independence between $(E_{m+i},X_{i})$ and $(E^{m+i-1},X^{i-1})$ . Combining (289) and (294), we have

[TABLE]

Since

[TABLE]

by (286), it follows from (295) that

[TABLE]

Since $X_{1}$ and $E_{1}$ are independent, we can rewrite (297) as

[TABLE]

In order to simplify the RHS of (298), we use the following two facts, whose proofs can be found in [15, Appendix]: For any $y\geq 0$ ,

[TABLE]

and

[TABLE]

Fix a sufficiently small $t>0$ such that ${\mathbb{E}}[X_{1}^{4}\mathrm{e}^{tX_{1}^{2}}]<\infty$ and $P-\frac{t{\mathbb{E}}[E_{1}^{2}]}{2}>0$ . Following (298), we use (299), (300) and (58) to obtain

[TABLE]

and

[TABLE]

which implies that

[TABLE]

Define $a_{t}$ and $b_{t}$ as in (60) and (61) respectively. It then follows from (301) and (303) that

[TABLE]

and

[TABLE]

Combining (284), (298), (304), (305), (299) and (300), we obtain

[TABLE]

Appendix F Proof of Corollary 8

Since $\{E_{k}\}_{k=1}^{m+n}$ is distributed according to an i.i.d.-block manner with block size $L$ , we cannot apply Lemma 7 directly for $L>1$ to bound ${\mathrm{Pr}}_{p_{X^{n}}p_{E^{m+n}}}\left\{\bigcup_{k=1}^{n}\left\{\sum_{i=1}^{k}X_{i}^{2}\geq\sum_{i=1}^{m+k}E_{i}\right\}\right\}$ . In the following, we will construct two sequences based on $\{E_{k}\}_{k=1}^{m+n}$ and $\{X_{k}\}_{k=1}^{n}$ so that Lemma 7 can be applied to the resultant sequences. Define

[TABLE]

and

[TABLE]

Let $\{X_{k}\}_{k=1}^{\bar{n}}$ be a sequence of i.i.d. random variables where $X_{1}\sim\mathcal{N}(x_{1};0,P)$ , and let $\{E_{k}\}_{k=1}^{\bar{m}+\bar{n}}$ be a sequence of random variables that are distributed according to (13). Since $\bar{n}\geq n$ and $\bar{m}\leq m$ , we have

[TABLE]

To simplify notation, define

[TABLE]

for each $\ell\in\{1,2,\ldots,(\bar{m}+\bar{n})/L\}$ , and define

[TABLE]

for each $\ell\in\{1,2,\ldots,\bar{n}/L\}$ . For each $k\in\{1,2,\ldots,\bar{n}\}$ , define $\nu(k)$ to be the unique integer in $\{1,2,\ldots,\bar{n}/L\}$ that satisfies $k\in\{b_{\nu(k)}+1,b_{\nu(k)}+2,\ldots,b_{\nu(k)}+L\}$ , i.e., $\nu(k)$ is the index of the information block that contains the $k^{\text{th}}$ symbol of $X^{\bar{n}}$ . Then for each $k\in\{1,2,\ldots,\bar{n}\}$ ,

[TABLE]

and

[TABLE]

with probability $1$ . Therefore,

[TABLE]

for each $k\in\{1,2,\ldots,\bar{n}\}$ , which implies that

[TABLE]

which then implies from (310) that

[TABLE]

By construction, $\left\{\tilde{E}_{\ell}\right\}_{\ell=1}^{(\bar{m}-L+\bar{n})/L}$ is a sequence of i.i.d. random variables with

[TABLE]

and

[TABLE]

and $\left\{\tilde{X}_{\ell}\right\}_{\ell=1}^{\bar{n}/L}$ is a sequence of i.i.d. Gaussian random variables with

[TABLE]

and

[TABLE]

Combining (318) and (321) and (322), we have

[TABLE]

and

[TABLE]

Since (323) holds, we can apply Lemma 7 to $\left\{\tilde{X}_{\ell}\right\}_{\ell=1}^{\bar{n}/L}$ and $\left\{\tilde{E}_{\ell}\right\}_{\ell=1}^{(\bar{m}-L+\bar{n})/L}$ if the following two statements hold for $t_{n}$ (which was defined in (66)):

[TABLE]

and

[TABLE]

To this end, we suppose $n$ and $m$ satisfy (67) and (68) respectively. Recalling the definition of $\beta_{0}$ in (64), we obtain from the definition of $t_{n}$ in (66) and (67) that

[TABLE]

which implies from (324) that

[TABLE]

In addition,

[TABLE]

Consequently, $\left\{\tilde{E}_{\ell}\right\}_{\ell=1}^{(\bar{m}-L+\bar{n})/L}$ and $\left\{\tilde{X}_{\ell}\right\}_{\ell=1}^{\bar{n}/L}$ are two sequences of i.i.d. random variables that satisfy (323), (328) and (330). Therefore, we have the following inequality due to Lemma 7 together with the definitions of $\alpha_{t}$ and $\beta_{t}$ in (63) and (65) respectively and the equalities (318), (319) and (322):

[TABLE]

In order to simplify the RHS of (331), we consider

[TABLE]

Following (331), we consider

[TABLE]

where (a) and (b) follow from the fact due to (66) and (308) that $t_{n}=\sqrt{\frac{\log(1/\varepsilon_{1})}{(\bar{n}/L)\beta_{0}}}$ . Combining (331) and (337), we have

[TABLE]

which implies from (317) that (69) holds.

Appendix G Simple derivations in the proof of Lemma 4

G-A Derivation of (93)

Consider the following chain of inequalities:

[TABLE]

where (a) follows from (91) and the triangle inequality for the $3$ -norm.

G-B Derivation of (104)

By Taylor’s theorem, we have

[TABLE]

for some real number

[TABLE]

Since

[TABLE]

and

[TABLE]

by (10) and (343) respectively, it follows that

[TABLE]

Consequently, (104) follows from (342) and (346).

Appendix H Simple derivations in the proof of Lemma 6

H-A Derivation of (160)

Consider the following chain of inequalities:

[TABLE]

where (a) follows from the triangle inequality for the $3$ -norm.

H-B Derivation of (167)

By Taylor’s theorem, we have

[TABLE]

for some real number

[TABLE]

Since

[TABLE]

by (10) and

[TABLE]

by (350), it follows that

[TABLE]

Appendix I Proof of Proposition 12

Fix any sufficiently large $n$ such that

[TABLE]

Then,

[TABLE]

In addition,

[TABLE]

Combining (358) and (362), we have

[TABLE]

It remains to prove (173). Using (363) and the definition of $L$ in (33), we have

[TABLE]

which together with the definition of $d$ in (35) implies (173).

Appendix J Proof of Proposition 13

Consider the following chain of inequalities where all the probability and expectation terms are evaluated with respect to $p_{E_{1}}p_{Z^{L}}$ :

[TABLE]

where (a) follows from Chebyshev’s inequality.

Appendix K Proof of Lemma 14

Since ${\mathrm{Var}}_{p_{E_{1}}}[E_{1}]=0$ by assumption, it follows that ${\mathbb{E}}_{p_{E_{1}}}[E_{1}^{2}]=\tilde{P}^{2}$ . Therefore, for any $\varepsilon>0$ and any sufficiently large $L$ that satisfies

[TABLE]

and

[TABLE]

we can use [15, Th. 1] to conclude that there exists an $\left(L-\left\lceil L^{2/3}\right\rceil+m,M,\varepsilon\right)$ -code such that

[TABLE]

where

[TABLE]

denotes the length of the initial saving period before any transmission occurs and $L-\left\lceil L^{2/3}\right\rceil$ denotes the length of the actual transmission period. Let $\varepsilon\triangleq\frac{1}{\sqrt{L}}$ and fix a sufficiently large $L$ that satisfies

[TABLE]

and

[TABLE]

Since (368) and (369) hold by (373) and (374), it follows from (371) and (372) that there exists an $\left(L-\left\lceil\sqrt{L}\right\rceil,M,\frac{1}{\sqrt{L}}\right)$ -code such that

[TABLE]

Acknowledgements

The authors would like to thank Associate Editor Michele Wigger and the three anonymous reviewers for the useful comments that greatly improve the presentation of this work.

Bibliography25

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] F. Zhang and V. K. N. Lau, “Closed-form delay-optimal power control for energy harvesting wireless system with finite energy storage,” IEEE Trans. Signal Process. , vol. 62, no. 21, pp. 5706–5715, 2014.
2[2] D. Tse and P. Viswanath, Fundamentals of Wireless Communication . Cambridge, U.K.: Cambridge University Press, 2005.
3[3] O. Ozel and S. Ulukus, “Achieving AWGN capacity under stochastic energy harvesting,” IEEE Trans. Inf. Theory , vol. 58, no. 10, pp. 6471–6483, 2012.
4[4] T. S. Han, Information-Spectrum Methods in Information Theory . Springer Berlin Heidelberg, 2003.
5[5] M. Hayashi, “Information spectrum approach to second-order coding rate in channel coding,” IEEE Trans. Inf. Theory , vol. 55, no. 11, pp. 4947–4966, 2009.
6[6] R. H. Etkin, D. N. C. Tse, and H. Wang, “Gaussian interference channel capacity to within one bit,” IEEE Trans. Inf. Theory , vol. 54, no. 12, pp. 5534–5562, Dec. 2008.
7[7] A. Özgür, O. Lévêque, and D. Tse, “Operating regimes of large wireless networks,” Foundations and Trends® in Networking , vol. 5, no. 1, pp. 1–107, 2011. [Online]. Available: http://dx.doi.org/10.1561/1300000016
8[8] R. Rajesh, V. Sharma, and P. Viswanath, “Capacity of Gaussian channels with energy harvesting and processing cost,” IEEE Trans. Inf. Theory , vol. 60, no. 5, pp. 2563–2575, 2014.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

On Achievable Rates of AWGN Energy-Harvesting Channels with Block Energy Arrival and Non-Vanishing Error Probabilities

Abstract

Index Terms:

I Introduction

I-A Main Contribution

I-B Related Work

I-C Paper Outline

I-D Notation

II Additive White Gaussian Noise Energy-Harvesting Channel with Block Energy Arrival

II-A Problem formulation

II-B Standard definitions

Definition 1

Definition 2

Definition 3

Definition 4

Definition 5

III Main Results

III-A When LLL is a constant or grows sublinearly in nnn

Definition 6

Definition 7

Theorem 1

Corollary 1

Corollary 2

III-B When LLL grows linearly in nnn

Theorem 2

Corollary 3

III-C Remarks on Theorem 1 and Corollary 1

Remark 1

Remark 2

Remark 3

Remark 4

Remark 5

III-D Remarks on Theorem 2 and Corollary 3

Remark 6

Remark 7

Remark 8

Remark 9

IV Proof of Theorem 1

Lemma 4

Remark 10

Corollary 5

Lemma 6

Remark 11

Proof:

Remark 12

V Proof of Lemma 4 via the Save-and-Transmit Strategy

V-A Prerequisites

Lemma 7

Corollary 8

Lemma 9** (Implied by Shannon’s bound [17])**

V-B Proof of Lemma 4

VI Proof of Lemma 6 via Binary Hypothesis Testing

VI-A Prerequisites

Definition 8

Lemma 10

Proposition 11

VI-B Proof of Lemma 6

VII When the Length of Each Energy Arrival Block Grows Linearly in Blocklength

Proposition 12

VII-A Achievability proof of Theorem 2

Definition 9

Proposition 13

Lemma 14

Lemma 15

Proof:

Proof:

VII-B Converse Proof of Theorem 2

VIII Concluding Remarks and Future Work

Appendix A Proof of Corollary 1

Appendix B Proof of Corollary 2

Appendix C Proof of Corollary 3

Appendix D Proof of Corollary 5

Appendix E Proof of Lemma 7

Appendix F Proof of Corollary 8

III-A When $L$ is a constant or grows sublinearly in $n$

III-B When $L$ grows linearly in $n$

Lemma 9 (Implied by Shannon’s bound [17])