Gaussian Multiple Access Channels with One-Bit Quantizer at the Receiver
Borzoo Rassouli, Morteza Varasteh, Deniz Gunduz

TL;DR
This paper investigates the capacity region of a Gaussian multiple access channel with a one-bit ADC at the receiver, revealing that optimal inputs are discrete and providing bounds on their complexity.
Contribution
It establishes that optimal input distributions are discrete and derives bounds on their number of mass points for channels with one-bit quantization.
Findings
Optimal input distributions are discrete.
Upper bounds on the number of mass points are derived.
Capacity region characterization under one-bit quantization.
Abstract
The capacity region of a two-transmitter Gaussian multiple access channel (MAC) under average input power constraints is studied, when the receiver employs a zero-threshold one-bit analog-to-digital converter (ADC). It is proved that the input distributions of the two transmitters that achieve the boundary points of the capacity region are discrete. Based on the position of a boundary point, upper bounds on the number of the mass points of the corresponding distributions are derived.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Gaussian Multiple Access Channels with One-Bit Quantizer at the Receiver
Borzoo Rassouli, Morteza Varasteh and Deniz Gündüz Borzoo Rassouli, Morteza Varasteh and Deniz Gündüz are with the Intelligent Systems and Networks group of Department of Electrical and Electronics, Imperial College London, United Kingdom. emails: {b.rassouli12, m.varasteh12, d.gunduz}@imperial.ac.uk.
Abstract
The capacity region of a two-transmitter Gaussian multiple access channel (MAC) under average input power constraints is studied, when the receiver employs a zero-threshold one-bit analog-to-digital converter (ADC). It is proved that the input distributions of the two transmitters that achieve the boundary points of the capacity region are discrete. Based on the position of a boundary point, upper bounds on the number of the mass points of the corresponding distributions are derived.
Index Terms:
Gaussian multiple access channel, one-bit quantizer, capacity region††This work has been presented partially in [1]..
I Introduction
The energy consumption of an analog-to-digital converter (ADC) (measured in Joules/sample) grows exponentially with its resolution (in bits/sample) [2], [3]. When the available power is limited, for example, for mobile devices with limited battery capacity, or for wireless receivers that operate on limited energy harvested from ambient sources [4], the receiver circuitry may be constrained to operate with low resolution ADCs. The presence of a low-resolution ADC, in particular a one-bit ADC at the receiver, alters the channel characteristics significantly. Such a constraint not only limits the fundamental bounds on the achievable rate, but it also changes the nature of the communication and modulation schemes approaching these bounds. For example, in a real additive white Gaussian noise (AWGN) channel under an average power constraint on the input, if the receiver is equipped with a -bin (i.e., -bit) ADC front end, it is shown in [5] that the capacity-achieving input distribution is discrete with at most mass points. This is in contrast with the optimality of the Gaussian input distribution when the receiver has infinite resolution.
Especially with the adoption of massive multiple-input multiple-output (MIMO) receivers and the millimeter wave (mmWave) technology enabling communication over large bandwidths, communication systems with limited-resolution receiver front ends are becoming of practical importance. Accordingly, there have been a growing research interest in understanding both the fundamental information theoretic limits and the design of practical communication protocols for systems with finite-resolution ADC front ends. In [6], the authors show that for a Rayleigh fading channel with a one-bit ADC and perfect channel state information at the receiver (CSIR), quadrature phase shift keying (QPSK) modulation is capacity-achieving. In case of no CSIR, [7] shows that (QPSK) modulation is optimal when the signal-to-noise (SNR) ratio is above a certain threshold, which depends on the coherence time of the channel, while for SNRs below this threshold, on-off QPSK achieves the capacity. For the point-to-point multiple-input multiple-output (MIMO) channel with a one-bit ADC front end at each receive antenna and perfect CSIR, [8] shows that QPSK is optimal at very low SNRs, while with perfect channel state information at the transmitter (CSIT), upper and lower bounds on the capacity are provided in [9].
To the best of our knowledge, the existing literature on communications with low-resolution ADCs focus exclusively on point-to-point systems. Our goal in this paper is to understand the impact of low-resolution ADCs on the capacity region of a multiple access channel (MAC). In particular, we consider a two-transmitter Gaussian MAC with a one-bit quantizer at the receiver. The inputs to the channel are subject to average power constraints. We show that any point on the boundary of the capacity region is achieved by discrete input distributions. Based on the slope of the tangent line to the capacity region at a boundary point, we propose upper bounds on the cardinality of the support of these distributions.
The paper is organized as follows. Section II introduces the system model. In Section III, the capacity region of a general two-transmitter memoryless MAC under input average power constraints is investigated. Through an example, it is shown that when there is input average power constraint, it is necessary to consider the capacity region with the auxiliary random variable in general. The main result of the paper is presented in Section III, and a detailed proof is given in Section IV. Finally, Section V concludes the paper.
Notations. Random variables are denoted by capital letters, while their realizations with lower case letters. denotes the cumulative distribution function (CDF) of random variable . The conditional probability mass function (pmf) will be written as . For integers , we have . For , denotes the binary entropy function. The unit-step function is denoted by .
II System model and preliminaries
We consider a two-transmitter memoryless Gaussian MAC (as shown in Figure 1) with a one-bit quantizer at the receiver front end. Transmitter encodes its message into a codeword and transmits it over the shared channel. The signal received by the decoder is given by
[TABLE]
where is an independent and identically distributed (i.i.d.) Gaussian noise process, also independent of the channel inputs and with . represents the one-bit ADC operation given by
[TABLE]
This channel can be modelled by the triplet , where () and (), respectively, are the alphabets of the inputs and the output. The conditional pmf of the channel output conditioned on the channel inputs and (i.e. ) is characterized by
[TABLE]
where .
Upon receiving the sequence , the decoder finds the estimates of the messages.
A code for this channel consists of (as in [10]):
- •
two message sets and ,
- •
two encoders, where encoder assigns a codeword to each message , and
- •
a decoder that assigns estimates or an error message to each received sequence .
We assume that the message pair is uniformly distributed over . The average probability of error is defined as
[TABLE]
Average power constraints are imposed on the channel inputs as
[TABLE]
where denotes the element of the codeword .
A rate pair is said to be achievable for this channel if there exists a sequence of codes (satisfying the average power constraints (3)) such that . The capacity region of this channel is the closure of the set of achievable rate pairs .
III Main results
Proposition 1. The capacity region of a two-transmitter memoryless MAC with average power constraints and is the set of non-negative rate pairs that satisfy
[TABLE]
for some , such that Also, it is sufficient to consider .
Proof.
The capacity region of the discrete memoryless (DM) MAC with input cost constraints has been addressed in Exercise 4.8 of [10]. If the input alphabets are not discrete, the capacity region is still the same because: 1) the converse remains the same if the inputs are from a continuous alphabet; 2) the region is achievable by coded time sharing and the discretization procedure (see Remark 3.8 in [10]). Therefore, it is sufficient to show the cardinality bound .
Let be the set of all product distributions (i.e., of the form ) on . Let be a vector-valued mapping defined element-wise as
[TABLE]
Let be the image of under the mapping (i.e., ). Given an arbitrary , we obtain the vector as
[TABLE]
Therefore, is in the convex hull of . By Carathéodory’s theorem [11], can be written as a convex combination of 6 () or fewer points in , which states that it is sufficient to consider . Since is a connected set111 is the product of two connected sets, therefore, it is connected. Each of the sets in this product is connected because of being a convex vector space. and the mapping is continuous222This is a direct result of the continuity of the channel transition probability., is a connected subset of . Therefore, connectedness of refines the cardinality of to .333This refinement of the cardinality is due to the connected version of Carathéodory’s theorem as mentioned in [11, p.267], which is originally due to [12, p.35-36]. ∎
Lemma 1. For the boundary points of that are not sum-rate optimal, it is sufficient to have .
Proof.
Any point on the boundary of the capacity region that does not maximize , is either of the form or for some that satisfies In other words, it is one of the corner points of the corresponding pentagon in (4). As in the proof of Proposition 1, define the mapping , where and are the coordinates of this boundary point conditioned on , and , are the same as and in (5), respectively. The sufficiency of in this case follows similarly to the proof of Proposition 1. ∎
When there is no input cost constraint, the capacity region of the MAC can be characterized either through the convex hull operation as in [10, Theorem 4.2], or with the introduction of an auxiliary random variable as in [10, Theorem 4.3]. The following remark states that when there are input cost constraints, the capacity characterization region requires an auxiliary random variable in general.
Remark 2. Let , such that . Let denote the set of non-negative rate pairs such that
[TABLE]
Let be the convex closure of , where the union is over all product distributions that satisfy the average power constraints.
Let be the set of non-negative rate pairs such that
[TABLE]
for some that satisfies .
It can be verified that . By comparing to the capacity region , we can conclude that . This follows from the fact that in the region , the average power constraint holds for every realization of the auxiliary random variable , which is a stronger condition than used in the capacity region. The following example shows that and can be strictly smaller than .
Consider the same Gaussian MAC with one-bit quantizer at the receiver (as depicted in Figure 1) with the following changes: i) , ii) Besides the average power constraints of , we also impose the constraint that the inputs should have a zero mean, i.e. The capacity region of this channel is the set of non-negative rate pairs such that (4) holds for some which satisfies Also, let be the rate region in Remark 2 with the additional constraints
In order to show that can be strictly smaller than the capacity region, we show that there exists a point in the capacity region which is not in . We have,
[TABLE]
where (6) is due to the fact that is a function of the pair , and the following Markov chain holds: . In (7), we use the inequality , since and are independent and zero mean. Also, the channel from to is characterized by the conditional distribution . Equality in (8) is due to [5], where the maximum is shown to be achieved by the CDF , where is the unit step function. Let , and . For this joint distribution on , we have , and , which results in (9).
In what follows, it is proved that the inequality in (7) is strict. In other words, the sum rate of cannot be obtained by any rate pair in , while it belongs to the capacity region. Let , where and are two zero-mean independent random variables on () satisfying the average power constraint . We show that the minimum Lévy distance444The Lévy distance between two distributions is defined as
between and all the distributions (induced by ) is bounded away from zero. Since and , the distribution of is with . The same applies to with parameter (). The distribution of induced by is given by
[TABLE]
where is similar to convolution operation. Let be the set of all distributions on obtained in this way. It can be easily verified that (see Figure 2) for any given , the Lévy distance between and is
[TABLE]
Subsequently,
[TABLE]
This shows that there is a neighborhood of whose intersection with is empty. Note that any neighborhood with radius less than has this property. Combined with the facts that the mutual information is continuous and is the unique solution555This is due to the strict convexity of , which is used in Jensen’s inequality in [5]., it proves that the inequality in (7) is strict. Therefore, () is smaller than the capacity region in general.
The main result of this paper is provided in the following theorem. It bounds the cardinality of the support set of the capacity achieving distributions.
Theorem 1. Let be an arbitrary point on the boundary of the capacity region of the memoryless MAC with a one-bit ADC front end666The results remain valid if the one-bit ADC has a non-zero threshold. (as shown in Figure 1). is achieved by a distribution in the form of . Also, let be the slope of the line tangent to the capacity region at this point. For any , the conditional input distributions and have at most and points of increase777A point is said to be a point of increase of a distribution if for any open set containing , we have , respectively, where
[TABLE]
Proof.
The proof is provided in Section IV. ∎
Proposition 1, Lemma 1 and Theorem 1 establish upper bounds on the number of mass points of the distributions that achieve a boundary point. The significance of this result is that once it is known that the optimal inputs are discrete with at most certain number of mass points, the capacity region along with the optimal distributions can be obtained via computer programs.
IV Proof of theorem 1
In order to show that the boundary points of the capacity region are achieved, it is sufficient to show that the capacity region is a closed set, i.e., it includes all of its limit points.
Let be a set with , and be defined as
[TABLE]
which is the set of all CDFs on the triplet , where is drawn from , and the Markove chain and the corresponding average power constraints hold.
In Appendix A, it is proved that is a compact set. Since a continuous mapping preserves compactness, the capacity region is compact. Since the capacity region is a subset of , it is closed and bounded888Note that a subset of is compact if and only if it is closed and bounded [13].. Therefore, any point on the boundary of the capacity region is achieved by a distribution denoted by .
Since the capacity region is a convex space, it can be characterized by its supporting hyperplanes. In other words, any point on the boundary of the capacity region, denoted by , can be written as
[TABLE]
for some .
Any rate pair must lie within a pentagon defined by (4) for some that satisfies the power constraints. Therefore, due to the structure of the pentagon, the problem of finding the boundary points is equivalent to the following maximization problem.
[TABLE]
where on the right hand side (RHS) of (13), the maximizations are over all that satisfy the power constraints. It is obvious that when , the two lines in (13) are the same, which results in the sum capacity.
For any product of distributions and the channel in (1), let be defined as
[TABLE]
With this definition, (13) can be rewritten as
[TABLE]
where the second maximization is over product distributions of the form , such that
[TABLE]
Proposition 2. For a given and any , is a concave, continuous and weakly differentiable function of . In the statement of this Proposition, and could be interchanged.
Proof.
The proof is provided in Appendix B. ∎
Proposition 3. Let be two arbitrary non-negative real numbers. For the following problem
[TABLE]
the optimal inputs and , which are not unique in general, have the following properties,
- (i)
The support sets of and are bounded subsets of . 2. (ii)
and are discrete distributions that have at most and points of increase, respectively, where
[TABLE]
Proof.
We start with the proof of the first claim. Assume that , and is given. Consider the following optimization problem:
[TABLE]
Note that , since for any , from (14),
[TABLE]
From Proposition 2, is a continuous, concave function of . Also, the set of all CDFs with bounded second moment (here, ) is convex and compact999The compactness follows from [14, Appendix I]. The only difference is in using Chebyshev’s inequality instead of Markov inequality.. Therefore, the supremum in (16) is achieved by a distribution . Since for any with , we have , the Lagrangian theorem and the Karush-Kuhn-Tucker conditions state that there exists a such that
[TABLE]
Furthermore, the supremum in (17) is achieved by , and
[TABLE]
Lemma 2. The Lagrangian multiplier is nonzero.
Proof.
Having a zero Lagrangian multiplier means the power constraint is inactive. In other words, if , (16) and (17) imply that
[TABLE]
We prove that (19) does not hold by showing that its left hand side (LHS) is strictly less than 1, while its RHS equals 1. The details are provided in Appendix C. ∎
() can be written as
[TABLE]
where we have defined
[TABLE]
and
[TABLE]
is nothing but the pmf of with the emphasis that it has been induced by and . Likewise, is the conditional pmf when is drawn according to . From (20), can be considered as the density of over when is given. can be interpreted in a similar way.
Note that (17) is an unconstrained optimization problem over the set of all CDFs. Since is linear and weakly differentiable in , the objective function in (17) is concave and weakly differentiable. Hence, a necessary condition for optimality of is
[TABLE]
Furthermore, (24) can be verified to be equivalent to
[TABLE]
The justifications of (24), (25) and (26) are provided in Appendix D.
In what follows, we prove that in order to satisfy (26), must have a bounded support by showing that the LHS of (26) goes to with . The following lemma is useful in the sequel for taking the limit processes inside the integrals.
Lemma 3. Let and be two independent random variables satisfying and , respectively (). Considering the conditional pmf in (1), the following inequalities hold.
[TABLE]
Proof.
The proof is provided in Appendix E. ∎
Note that
[TABLE]
where (30) is due to Lebesgue dominated convergence theorem [13] and (27), which permit the interchange of the limit and the integral; (31) is due to the following
[TABLE]
since goes to zero when and is bounded away from zero by (85) ; and (32) is obtained from (85) in Appendix E. Furthermore,
[TABLE]
where (33) is due to Lebesgue dominated convergence theorem along with (29) and (90) in Appendix E; (34) is from (28) and convexity of in when (see Appendix F).
Therefore, from (32) and (34),
[TABLE]
Using a similar approach, we can also obtain
[TABLE]
From (35), (36) and the fact that (see Lemma 2), the LHS of (25) goes to when . Since any point of increase of must satisfy (25) with equality, and , it is proved that has a bounded support, i.e., for some .101010Note that and are determined by the choice of .
Similarly, for a given , the optimization problem
[TABLE]
boils down to the following necessary condition
[TABLE]
for the optimality of . However, there are two main differences between (38) and (26). First is the difference between and . Second is the fact that we do not claim to be nonzero, since the approach used in Lemma 2 cannot be readily applied to . Nonetheless, the boundedness of the support of can be proved by inspecting the behaviour of the LHS of (38) when .
In what follows, i.e., from (IV) to (44), we prove that the support of is bounded by showing that (38) does not hold when is above a certain threshold. The first term on the LHS of (38) is . From (23) and (27), it can be easily verified that
[TABLE]
From (IV), if , the LHS of (38) goes to with , which proves that is bounded.
For the possible case of , in order to show that (38) does not hold when is above a certain threshold, we rely on the boundedness of . Note that, since has a bounded support, we denote its support, without loss of generality, by , where are some non-negative real numbers. Then, we prove that approaches its limit in (IV) from below. In other words, there is a real number such that when , and when . This establishes the boundedness of . In what follows, we only show the former, i.e., when . The latter, i.e., , follows similarly, and it is omitted for the sake of brevity.
By rewriting , we have
[TABLE]
It is obvious that the first term in the RHS of (40) approaches from below when , since . It is also obvious that the remaining terms go to zero when . Hence, it is sufficient to show that the second line of (40) approaches zero from below, which is proved by using the following lemma.
Lemma 4. Let be distributed on according to . We have
[TABLE]
Proof.
The proof is provided in Appendix G. ∎
From (41), we can write
[TABLE]
where (due to concavity of ), and when (due to (41)). Also, from the fact that , we have
[TABLE]
where and when . From (42) and (43), the second line of (40) becomes
[TABLE]
Since and as , there exists a real number such that when . Therefore, the second line of (40) approaches zero from below, which proves that the support of is bounded away from . As mentioned before, a similar argument holds when . This proves that has a bounded support.
Remark 3. We remark here that the order of showing the boundedness of the supports is important. First, for a given (not necessarily bounded), it is proved that is bounded. Then, for a given bounded , it is shown that is also bounded. The order is reversed when , and it follows the same steps as in the case of . Therefore, it is omitted.
We next prove the second claim in Proposition 3. We assume that , and a bounded is given. We already know that for a given bounded , has a bounded support denoted by . Therefore,
[TABLE]
where denotes the set of all probability distributions on the Borel sets of . Let denote the probability of the event , induced by and the given . Also, let denote the second moment of under . The set
[TABLE]
is the intersection of with two hyperplanes.111111Note that is convex and compact.. We can write
[TABLE]
Note that having , the objective function in (47) becomes
[TABLE]
Since the linear part is continuous and is compact121212The continuity of the linear part follows similarly to the continuity arguments in Appendix B. Note that this compactness is due to the closedness of the intersecting hyperplanes in , since a closed subset of a compact set is compact [13]. The hyperplanes are closed due to continuity of and (see (66))., the objective function in (47) attains its maximum at an extreme point of , which, by Dubins’ theorem, is a convex combination of at most three extreme points of . Since the extreme points of are the CDFs having only one point of increase in , we conclude that given any bounded , has at most three mass points.
Now, assume that an arbitrary is given with at most three mass points denoted by . It is already known that the support of is bounded, which is denoted by . Let denote the set of all probability distributions on the Borel sets of . The set
[TABLE]
is the intersection of with four hyperplanes131313Note that here, since we know , the optimal input attains its maximum power of .. In a similar way,
[TABLE]
and having , the objective function in (50) becomes
[TABLE]
Therefore, given any with at most three points of increase, has at most five mass points.
When , the second term on the RHS of (51) disappears, which means that could be replaced by
[TABLE]
where is the probability of the event , which is induced by and the given . Since the number of intersecting hyperplanes has been reduced to two, it is concluded that has at most three points of increase. ∎
Remark 4. Note that, the order of showing the discreteness of the support sets is also important. First, for a given bounded (not necessarily discrete), it is proved that is discrete with at most three mass points. Then, for a given discrete with at most three mass points, it is shown that is also discrete with at most five mass points when , and at most three mass points when . When , the order is reversed and it follows the same steps as in the case of . Therefore, it is omitted.
V Conclusion
We have studied the capacity region of a two-transmitter Gaussian MAC under average input power constraints and one-bit ADC front end at the receiver. We have shown that an auxiliary random variable is necessary for characterizing the capacity region in general. We have derived an upper bound on the cardinality of this auxiliary variable, and proved that the distributions that achieve the boundary points of the capacity region are finite and discrete.
Appendix A
Since , we assume without loss of generality, since what matters in the evaluation of the capacity region is the mass probability of the auxiliary random variable , not its actual values.
In order to show the compactness of , we adopt a general form of the approach in [14].
First, we show that is tight141414A set of probability distributions defined on , i.e. the set of CDFs , is said to be tight, if for every , there is a compact set such that [15]
\mbox{Pr}\big{\{}(X_{1},X_{2},\ldots,X_{k})\in\mathds{R}^{k}\backslash K_{\epsilon}\big{\}}<\epsilon,\ \forall F_{X_{1},X_{2},\ldots,X_{k}}\in\Theta.
. Choose , , such that . Then, from Chebyshev’s inequality,
[TABLE]
Let . It is obvious that is a closed and bounded subset of , and therefore, compact. With this choice of , we have
[TABLE]
where (53) is due to (52). Hence, is tight.
From Prokhorov’s theorem [15, p.318], a set of probability distributions is tight if and only if it is relatively sequentially compact151515A subset of topological space is relatively compact if its closure is compact.. This means that for every sequence of CDFs in , there exists a subsequence that is weakly convergent161616The weak convergence of to (also shown as ) is equivalent to
(54)
for all continuous and bounded functions on . Note that if and only if . to a CDF , which is not necessarily in . If we can show that this is also an element of , then the proof is complete, since we have shown that is sequentially compact, and therefore, compact171717Compactness and sequentially compactness are equivalent in metric spaces. Note that is a metric space with Lévy distance..
Assume a sequence of distributions in that converges weakly to . In order to show that this limiting distribution is also in , we need to show that both the average power constraints and the Markov chain () are preserved under . The preservation of the second moment follows similarly to the argument in [14, Appendix I]. In other words, since is continuous and bounded below, from [16, Theorem 4.4.4]
[TABLE]
Therefore, the second moments are preserved under the limiting distribution .
For the preservation of the Markov chain , we need the following proposition.
Proposition 4. Assume a sequence of distributions over the pair of random variables that converges weakly to . Also, assume that has a finite support, i.e., . Then, the sequence of conditional distributions (conditioned on ) converges weakly to the limiting conditional distribution (conditioned on ), i.e.,
[TABLE]
Proof.
The proof is by contradiction. If (56) is not true, then there exists , such that . This means, from the definition of weak convergence, that there exists a bounded continuous function of , denoted by , such that
[TABLE]
Let be a bounded continuous function that satisfies
[TABLE]
With this choice of , we have
[TABLE]
which violates the assumption of the weak convergence of to . Therefore, (56) holds. ∎
Since in converges weakly to and is finite, from Proposition 4, we have
[TABLE]
where it is obvious that the arguments are and . Since , we have . Also, since the convergence of the joint distribution implies the convergence of the marginals, we have [17], [18, Theorem 2.7],
[TABLE]
which states that under the limiting distribution , the Markov chain is preserved.181818Alternatively, this could be proved by the lower-semicontinuity of the mutual information as follows.
(62)
(63)
where denotes the mutual information under distribution . The last equality is from the conditional independence of and given under . Therefore, , which is equivalent to (61). This completes the proof of the compactness of .
Appendix B Proof of Proposition 2
B-A Concavity
When , we have
[TABLE]
For a given , is a concave function of , while and are linear in . Therefore, is a concave function of . For a given , and are concave functions of , while is linear in . Since , is a concave function of . The same reasoning applies to the case .
B-B Continuity
When , the continuity of the three terms on the RHS of (64) is investigated. Let be a sequence of distributions which is weakly convergent to . For a given , we have
[TABLE]
where (65) is due to the fact that the function can be dominated by 1, which is an absolutely integrable function over . Therefore, is continuous in , and combined with the weak convergence of , we can write
[TABLE]
This allows us to write
[TABLE]
which proves the continuity of in . is a bounded () continuous function of , since it is a continuous function of , and the latter is continuous in (see (66)). Therefore,
[TABLE]
which proves the continuity of in . In a similar way, it can be verified that is a bounded and continuous function of which guarantees the continuity of in , since
[TABLE]
Therefore, for a given , is a continuous function of . Exchanging the roles of and and also the case can be addressed similarly, and are omitted for the sake of brevity.
B-C Weak Differentiability
For a given , the weak derivative of at is given by
[TABLE]
if the limit exists. It can be verified that
[TABLE]
where has been defined in (23). In a similar way, for a given , the weak derivative of at is
[TABLE]
where has been defined in (22). The case can be addressed similarly.
Appendix C Proof of Lemma 2
We have
[TABLE]
where (70) is from the non-negativity of mutual information and the assumption that ; (71) is justified since the function is monotonically decreasing and the sign of the inputs does not affect the average power constraints, and can be assumed non-negative (or alternatively non-positive) without loss of optimality; in (72), we use the fact that , and for , ; (73) is based on the convexity and monotonicity of the function in , which is shown in Appendix F. Therefore, the LHS of (19) is strictly less than 1.
Since has a finite second moment (), from Chebyshev’s inequality, we have
[TABLE]
Fix and consider . By this choice of , we get
[TABLE]
[TABLE]
where (78) is due to (75) and the fact that is minimized over at (or, alternatively at ), and is maximized at . (78) shows that () can become arbitrarily close to 1 given that is large enough. Hence, its supremum over all distributions is 1. This means that (19) cannot hold, and .
Appendix D Justification of (24), (25) and (26)
Let be a vector space, and be a real-valued function defined on a convex domain . Suppose that maximizes on , and that is Gateaux differentiable (weakly differentiable) at . Then, from [19, Th.2, p.178],
[TABLE]
where is the weak derivative of at .
From (69), we have the weak derivative of at as
[TABLE]
Now, the derivation of (24) is immediate by inspecting that the weak derivative of the objective of (17) at is given by
[TABLE]
Letting (81) be lower than or equal to zero (as in (79)) results in (24).
The equivalence of (24) to (25) and (26) follows similarly to the proof of Corollary 1 in [20, p.210].
Appendix E Proof of Lemma 3
(27) is obtained as follows.
[TABLE]
where (82) is due to the fact that the binary entropy function is upper bounded by 1. (83) is justified as follows.
[TABLE]
where (85) is based on the convexity and monotonicity of the function , which is shown in appendix F.
(28) is obtained as follows.
[TABLE]
where (86) is due to convexity of in for .
(29) is obtained as follows.
[TABLE]
where (87) is from ; and (88) is from (86) and (85).
Note that, (88) is integrable with respect to due to the concavity of in for as shown in Appendix F. In other words,
[TABLE]
Appendix F Two convex functions
Let for . We have,
[TABLE]
and
[TABLE]
where . Note that
[TABLE]
where (92) and (93) are, respectively, due to and (). Therefore,
[TABLE]
which makes the second derivative in (91) positive, and proves the (strict) convexity of .
Let for . By simple differentiation, the Hessian matrix of is
[TABLE]
It can be verified that and . Therefore, both eigenvalues of are positive, which makes the matrix positive definite. Hence, is (strictly) convex in .
Appendix G Proof of lemma 4
Let .
It is obvious that
[TABLE]
Therefore, we can write
[TABLE]
for some Note that is a function of . Also, due to concavity of , we have
[TABLE]
From the fact that
[TABLE]
we can also write
[TABLE]
where (96) and (98) have been used in (99). (97) and (99) are depicted in Figure 3.
[TABLE]
Let
[TABLE]
This minimizer satisfies the following equality
[TABLE]
Therefore, we can write
[TABLE]
where (103) is from the definition in (101); (104) is from the expansion of (102), and is the derivative of the binary entropy function; (105) is due to the fact that is a decreasing function.
Applying L’hospital’s rule multiple times, we obtain
[TABLE]
[TABLE]
From (100), (105) and (106), (41) is proved. Note that the boundedness of is crucial in the proof. In other words, the fact that as is the very result of .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] B. Rassouli, M. Varasteh, and D. G nd z, “Capacity region of a one-bit quantized gaussian multiple access channel,” in IEEE Int’l Sym. Inf. Theory (ISIT) , Achen, Germany, Jun. 2017, pp. 2633–2637.
- 2[2] R. Walden, “Analog-to-digital converter survey and analysis,” IEEE Journal on Selected Areas in Comm. , vol. 17, no. 4, pp. 539–550, Apr. 1999.
- 3[3] B. Murmann, “ADC performance survey,” Co RR , vol. abs/1404.7736, 1997-2014. [Online]. Available: http://web.stanford.edu/ murmann/adcsurvey.html
- 4[4] D. Gunduz, K. Stamatiou, N. Michelusi, and M. Zorzi, “Designing intelligent energy harvesting communication systems,” IEEE Comm. Magazine , vol. 52, pp. 210–216, January 2014.
- 5[5] J. Singh, O. Dabeer, and U. Madhow, “On the limits of communication with low-precision analog-to-digital conversion at the receiver,” IEEE Trans. Communications , vol. 57, no. 12, pp. 3629–3639, December 2009.
- 6[6] S. Krone and G. Fettweis, “Fading channels with 1-bit output quantization: Optimal modulation, ergodic capacity and outage probability,” in Proc. IEEE Inf. Theory Workshop (ITW) , Aug. 2010, pp. 1–5.
- 7[7] A. Mezghani and J. Nossek, “Analysis of Rayleigh-fading channels with 1-bit quantized output,” IEEE Int. Sym. Inf. Theory , pp. 260–264, Jul. 2008.
- 8[8] ——, “On ultra-wideband MIMO systems with 1-bit quantized outputs: Performance analysis and input optimization,” IEEE Int. Sym. Inf. Theory , pp. 1286–1289, Jun. 2007.
