Precoding for the Sparsely Spread MC-CDMA Downlink with Discrete-Alphabet Inputs
Min Li, Chunshan Liu, and Stephen V. Hanly

TL;DR
This paper introduces a power-efficient precoding algorithm for sparse MC-CDMA downlink systems with discrete inputs, reducing complexity while maintaining performance through sparse signatures.
Contribution
It proposes a novel precoding method leveraging sparse signatures for downlink MC-CDMA, improving power efficiency and reducing computational complexity compared to traditional methods.
Findings
Significance of signature sparsity for reducing precoding complexity.
Power reduction gains over zero-forcing precoding.
Sparse MC-CDMA achieves similar throughput with lower complexity.
Abstract
Sparse signatures have been proposed for the CDMA uplink to reduce multi-user detection complexity, but they have not yet been fully exploited for its downlink counterpart. In this work, we propose a Multi-Carrier CDMA (MC-CDMA) downlink communication, where regular sparse signatures are deployed in the frequency domain. Taking the symbol detection point of view, we formulate a problem appropriate for the downlink with discrete alphabets as inputs. The solution to the problem provides a power-efficient precoding algorithm for the base station, subject to minimum symbol error probability (SEP) requirements at the mobile stations. In the algorithm, signature sparsity is shown to be crucial for reducing precoding complexity. Numerical results confirm system-load-dependent power reduction gain from the proposed precoding over the zero-forcing precoding and the regularized zero-forcing…
| 1) Initialize: dual variables , ; |
| 2) Repeat for iterations until convergence criterion is met: |
| 2.1) For : |
| PSN unit computes its primal variables using (V-A), |
| and broadcasts the updated values to its neighboring OSN units; |
| 2.2) For : |
| OSN unit computes its dual variables |
| using (V-A) and broadcasts the updated values to its neighboring |
| PSN units. |
| Schemes \ Operations | “” / iteration | “” / iteration | complexity ( iterations) | |
|---|---|---|---|---|
| 4-QAM | compute | |||
| compute | ||||
| 16-QAM | compute | |||
| compute | ||||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Precoding for the Sparsely Spread MC-CDMA Downlink with Discrete-Alphabet Inputs
Min Li, , Chunshan Liu, , and Stephen V. Hanly Copyright (c) 2015 IEEE. Personal use of this material is permitted. However, permission to use this material for any other purposes must be obtained from the IEEE by sending a request to [email protected]. This work was presented in part in Proceedings of the IEEE International Conference on Communications (ICC), Sydney, Australia, June 2014. This research was supported in part by the Australian Research Council under grant DP130101760, and by the CSIRO Macquarie University Chair in Wireless Communications. This Chair has been established with funding provided by the Science and Industry Endowment Fund. The authors are with the Department of Engineering, Macquarie University, Macquarie Park, NSW 2113, Australia (e-mail: {min.li, chunshan.liu, stephen.hanly}@mq.edu.au).
Abstract
Sparse signatures have been proposed for the CDMA uplink to reduce multi-user detection complexity, but they have not yet been fully exploited for its downlink counterpart. In this work, we propose a Multi-Carrier CDMA (MC-CDMA) downlink communication, where regular sparse signatures are deployed in the frequency domain. Taking the symbol detection point of view, we formulate a problem appropriate for the downlink with discrete alphabets as inputs. The solution to the problem provides a power-efficient precoding algorithm for the base station, subject to minimum symbol error probability (SEP) requirements at the mobile stations. In the algorithm, signature sparsity is shown to be crucial for reducing precoding complexity. Numerical results confirm system-load-dependent power reduction gain from the proposed precoding over the zero-forcing precoding and the regularized zero-forcing precoding with optimized regularization parameter under the same SEP targets. For a fixed system load, it is also demonstrated that sparse MC-CDMA with a proper choice of sparsity level attains almost the same power efficiency and link throughput as that of dense MC-CDMA yet with reduced precoding complexity, thanks to the sparse signatures.
Index Terms:
CDMA, discrete alphabets, MC-CDMA, power efficiency, precoding, sparse signature, symbol error probability.
I Introduction
I-A Motivations and Contributions
Multi-Carrier Code Division Multiple Access (MC-CDMA) is a multi-access scheme based on the Orthogonal Frequency Division Multiplexing (OFDM) method. Since its invention, MC-CDMA has attracted broad interest, see, e.g., [1, 2, 3, 4] and the references therein. MC-CDMA naturally integrates CDMA’s flexible multiuser access with interference suppression capability and the advantages of multicarrier OFDM, including robustness against frequency-selective fading. Therefore, it has the potential to be one of the candidates to support massive access and provide reliable data communication and better coverage for future-generation wireless systems.
As in all CDMA systems, MC-CDMA may experience severe multi-access interference due to the loss of user orthogonality, which may occur, particularly, in frequency-selective channel environments. For such systems, optimal detection entails an exponential number of hypothesis testings about data symbols of all users and thus could be computationally demanding, time-consuming and even infeasible in a large system with conventional dense signature design. To circumvent the complexity issue, sparse signatures, whose fraction of non-zero entries is small, have been introduced and exploited for the CDMA uplink multi-user detection [5, 6, 7, 8, 9, 10]. In particular, the belief-propagation algorithm has been proposed for such a system, an algorithm that has a natural implementation using parallel computation units, and one that is fast and provably optimal for different ensembles of sparse signatures in the large system limits [5, 6, 7, 8]. Inspired by the belief-propagation algorithm, references [9] and [10] have developed reduced-complexity soft-in soft-out (SISO) and Turbo iterative multiuser detection algorithms for the sparse CDMA uplink and a sparse-signature OFDM uplink, respectively.
In this work, we formulate a different problem, appropriate for the MC-CDMA downlink counterpart, from the symbol detection point of view. The solution to the problem provides a power-efficient precoding algorithm for the base station (BS). The precoding is implemented at the BS, allowing the mobile stations (MSs) to use simple conventional single-user matched filters and standard single-user symbol detection. This hence simplifies the implementation of the receiver, as compared to the conventional MC-CDMA downlink transceiver design, where multi-user interference is normally mitigated by a frequency-domain equalization at the receiver [2]. Moreover, the proposed algorithm optimizes the signals transmitted on different subcarriers so that they can be constructively combined at each receiver, leading to a power-efficient precoding. The use of random sparse signatures was suggested for the MC-CDMA downlink in [11] to allow low-complexity iterative multiuser detection at each MS. But here we show that sparsity can be exploited to reduce the complexity of the precoding, as compared to MC-CDMA with dense signatures. In addition, using sparse signatures simplifies channel measurement, since each MS only needs to estimate channels for a small number of subcarriers it occupies.
The contributions of this paper are summarized as follows:
- •
We introduce the MC-CDMA downlink communication with regular sparse signatures, where each MS has access to an equal number of subcarriers and each subcarrier has (roughly) equal load. We also consider a bipartite graph representation for the system studied and use it to facilitate algorithm design and complexity analysis.
- •
Assume that data symbols intended for MSs are drawn from discrete alphabets. We take the symbol detection point of view and introduce the minimum Symbol Error Probability (SEP) as a Quality of Service (QoS) metric for each MS. We formulate the precoding problem as a transmit power optimization problem subject to minimum SEP requirements at MSs. We translate the SEP targets into constraint regions on noiseless received signal components at MSs and characterize them via a conservative approximation to make the problem tractable. Detailed formulation procedures are provided for systems with both standard 4/16-QAM constellations and Tomlinson-Harashima replica points.
- •
We develop a precoding algorithm that accommodates parallel computation units via the dual-decomposition theory. Aided by the graph representation of the system, the complexity of the algorithm is characterized in terms of the number of message passings between computation units and the number of additions and multiplications for precoding calculation. Signature sparsity is shown to play a vital role in reducing precoding complexity.
- •
We demonstrate that the proposed optimized precoding generally outperforms the conventional zero-forcing (ZF) and the optimized Regularized ZF (RZF) precodings in terms of power efficiency under the same SEP requirements. The exact gain depends on the system load and it is very significant for a fully loaded system. We also demonstrate that, for a fixed system load, sparse MC-CDMA, e.g., with a proper relatively small number of subcarriers allocated to each MS, attains almost the same power efficiency and link throughput as dense MC-CDMA under our proposed precoding scheme. This important observation, in conjunction with the fact that sparsity reduces precoding complexity, promotes the practicality of sparse MC-CDMA.
I-B Other Related Work
Precoding is a relatively mature concept in multi-antenna communication systems, enabling multiuser multiplexing in the spatial domain, see, e.g., [12, 13, 14, 15, 16, 17, 18, 19, 20]. It has to balance between the two conflicting interests of maximizing the useful signal power at the intended user and minimizing interference leakage towards non-intended users. The same concept can be applied to other systems such as direct-sequence CDMA or MC-CDMA, where multiplexing takes place in the time- or frequency-domain, and multi-access interference, if it arises, has to be dealt with [21, 22, 23, 24, 25].
Existing precoding techniques can be divided into two categories, linear precoding and non-linear precoding, where both require channel state information while the latter requires additional symbol-based processing. Within the first category, matched filtering, ZF and RZF [12] are three commonly known precodings that maintain different levels of balance between the two conflicting goals. Other power-aware linear precodings of general form have also been proposed in the literature subject to different QoS metrics, such as signal-to-noise-plus-interference ratio [20].
Compared with linear precoding, non-linear precoding may offer higher power efficiency and transmission rate, but the gain comes at the cost of incorporating more sophisticated signal processing [13, 15, 16, 17, 23, 24, 25]. In the multi-antenna broadcasting setup, capacity-achieving non-linear Dirty-Paper Coding (DPC) entails a successive pre-cancelation of known intra-user interference at the BS. The encoding of data relies on codewords of infinite length and involves a high-dimensional sphere-search algorithm, which renders DPC unattractive in practical systems. Tomlinson-Harashima precoding (THP) is a simplified version of DPC, where the codebook is comprised of periodic extension of standard constellations (replica points) in the two-dimensional space and a transmit modulo-operation is introduced in the interference pre-cancelation process in order to reduce transmit power. Built on ZF or RZF, reference [13] generalizes the single-user-based symbol extension idea of THP and introduces a joint perturbation of user symbol vector to further reduce transmit power.
The optimized precoding proposed in this work belongs to the second category. As in existing works, power consumption is one of the primary concerns in our optimized precoding. However, the optimization criterion, minimum SEP constraint, has not been considered before, except our own works [26, 27, 28] in the MIMO (or distributed MIMO) setup. This criterion appears to be natural when we consider a system with discrete alphabets as inputs. In addition, in our formulation, we fix the information-bearing alphabets at the BS but allow a certain relaxation of received signals at MSs through precoding, as long as they reside in detection-favorable regions and the SEP targets are met. This is distinguished from related works [16, 18, 19, 25], where non-linear relaxation of input alphabets has been adopted. In [16, 18], the relaxation is required to maintain the minimum signalling distance, while in [25, 19], the relaxation is to ensure the corresponding symbol-energy-to-noise ratio is above a certain threshold [25, 19]. However, in all these works, no explicit SEP targets are imposed at MSs.
Notation: Boldface uppercase and lowercase letters denote matrices and vectors, respectively, e.g., is a matrix and is a vector; is an identity matrix; for integers , , is the discrete interval between and , and , is the collection of th components of vector ; denotes the matrix transpose, while denotes the conjugate transpose; notation denotes the expectation operation on random variable , and denotes a floor function of real number ; and denote the real and imaginary part of a complex number, respectively; finally, is the standard norm of .
II System Model
II-A Signaling Model
We consider a downlink communication, where a single-antenna BS is simultaneously serving single-antenna MSs via MC-CDMA. Specifically, data symbols intended for MSs are all drawn from discrete alphabet sets, e.g., -QAM constellations, common in practical deployments. The downlink communication takes place over a set of orthogonal subcarriers where we assume and thus the load . In the conventional MC-CDMA, the data symbol intended for a MS is transmitted over all parallel subcarriers where each is encoded with a binary phase-offset [1]. Here, however, information associated with each MS is assumed to be spread onto only a small subset of the available subcarriers, which leads to the sparse MC-CDMA as originally studied by [5, 6, 7, 8] for the uplink.
Let be the signature for MS , where normalization factor corresponds to the total number of subcarriers allocated to MS . In the signature, components are i.i.d. drawn from a distribution with zero-mean and unit-variance if MS has access to subcarrier , and otherwise. The collection of all signatures corresponds to a sparse signature matrix , which is perfectly known at the BS.
The transceiver architecture for the downlink transmission is depicted in Fig. 1 and is elaborated as follows. Let be the transmitted data symbol vector, where component denotes the symbol intended for MS and is drawn from a discrete finite-alphabet set . The transmission takes place by first forming appropriate frequency-domain signals and then converting them into time-domain signals by the inverse fast Fourier transform (IFFT) at the BS. Specifically, symbol vector is passed through a precoder and mapped to coded vector . An IFFT is then applied over the coded vector in order to generate time-domain signal vector that is subsequently transmitted over the wireless channel.
Upon observing channel output, each MS first performs an FFT and produces the frequency domain signal as
[TABLE]
where notation “” denotes the Hadamard product; vector is the collection of frequency-domain channel gains from BS to MS ; vector is a circularly symmetric complex Gaussian noise with . Despreading is then performed at MS based on its own signature , followed by a simple single-user detection. The corresponding output signal after despreading is given by
[TABLE]
where is the th component of , and each equivalent channel noise is circularly symmetric complex Gaussian with zero mean and variance . Collecting all outputs at MSs, we obtain the equivalent system input-output relationship as
[TABLE]
It is straightforward to observe that in matrix , as long as , and thus row maintains the same level of sparsity as the corresponding signature . This also means the BS only needs to know the small number of s for which for the purpose of precoding.
II-B Graph Representation
Given the matrix from (19), we can alternatively construct a bipartite graph representation of the sparse MC-CDMA system. Assume that each symbol in the graph is represented by a precoded symbol node (PSN), and each output is represented by a output symbol node (OSN). PSNs will be drawn as circles and OSNs will be drawn as squares in the graph. PSN is connected with OSN only if and is the weight associated with the edge. Fig. 2 depicts an instance of the graph for , where each OSN is connected with two PSNs. We use to denote the collection of OSNs connected to and define the node degree of as the cardinality . Similarly, we use to denote the collection of PSNs connected to and define the node degree of as the cardinality . This graph representation introduced will facilitate the description of the precoding algorithm and the corresponding complexity analysis in Section V.
II-C Sparse Signature Ensemble
In the signature matrix, we assume that the non-zero elements are i.i.d. drawn from a uniform distribution on . It is observed that generating non-zero elements according to other distributions, e.g., Gaussian distribution, has little impact on the averaged system performance. Hence, we stick to the binary uniform distribution, which leads to a binary phase-offset for the symbol transmitted as in [6, 7].
Depending on the number of subcarriers allocated across MSs and the load per subcarrier, we have three common signature ensembles suggested for the uplink [7]: i) irregular ensemble, where Poisson-distributed number of subcarriers are allocated across MSs and the load per subcarrier is also Poisson-distributed; ii) semi-regular ensemble, where each MS is allocated a fixed positive integer number of subcarriers and the load per subcarrier is Poisson-distributed; and iii) regular ensemble, where the number of subcarriers allocated for each MS and the loading per subcarrier both take fixed positive integer values. In particular, [7] advocates the regular ensemble as it amongst others prevents the systematic inefficiency due to leaving some subcarriers unoccupied by any of MSs.
In this work, we follow [7] and deploy the regular-type ensemble to ensure that the system enjoys full utilization of resources and provides user fairness. Specifically, when the system is fully-loaded (), a perfectly regular signature matrix is randomly generated in the sense that each MS is allocated subcarriers, and each subcarrier is accessed by exactly MSs. When the system is under-loaded (), a nearly regular signature matrix is randomly generated such that each MS is allocated subcarriers, and each subcarrier is of almost equal load, namely, accessed by either or MSs.
III Optimized Precoding with SEP Targets
For notational convenience, we define as the noiseless received component at MS ( is the th row of matrix ) with real part and imaginary part ; similarly, denote the real and imaginary parts of data symbol and noise as and , and and , respectively. In addition, define as the fixed noise variance per signal dimension.
Assuming that all data symbols transmitted are selected from discrete alphabets, we take a symbol detection point of view and impose minimum Symbol Error Probabilities (SEPs) as the user QoS constraints. Specifically, let denote the decision region associated with data symbol intended for MS and denote the SEP target. Detection error happens when the output signal lies outside decision region . According to the SEP requirement, the probability of error events should be no greater than the target, i.e.,
[TABLE]
A question we then ask is: How do we design a precoder that efficiently maps a symbol vector into a precoded vector such that the SEP requirements at MSs can be met?
Following the conventional zero-forcing (ZF) approach, one could form according to
[TABLE]
which inverts the channel matrix and forces noiseless component to sit exactly at the constellation point . In order to meet a given SEP target, , data symbol has to be chosen from a discrete alphabet set whose minimum distance between any two neighboring points (denoted by ) is above a certain threshold.
For instance, consider a system with -QAM modulation whose standard constellation is represented by
[TABLE]
with . We need to scale the constellation points (increasing the minimum distance , but also the transmit power) in order to meet the SEP target, . Considering the 4-QAM constellation (see Fig. 3), the scaling factor, , to use for MS , must satisfy
[TABLE]
as implied by (20). Thus the minimum scaling factor under the conventional ZF approach for the 4-QAM system is given by
[TABLE]
where denotes the inverse of the standard -function [29]. When (see Fig. 4 for 16-QAM), the standard constellation should be scaled so that
[TABLE]
as implied by (20), considering the dominant scenario in which one of the inner most points is transmitted. Thus the minimum scaling factor under the conventional ZF approach is given by
[TABLE]
In general, however, we do not have to zero-force , and in fact, it is sufficient to ensure falls into a region that favours correct symbol detection. This relaxation introduces room to optimize the choice of , leading to the following transmit power minimization problem:
[TABLE]
IV Sparse MC-CDMA with Standard -QAM Constellations
In this section, we show how to translate the set of SEP targets in (30) into constraints on noiseless output components at MSs. In particular, we begin with the 4-QAM signaling case and then generalize to the 16-QAM signaling case. A similar approach can be applied to systems with higher-order QAM constellations.
IV-A Translate SEP Targets to Constraints on Noiseless Received Signal Components
IV-A1 4-QAM
Assume that each is drawn from the 4-QAM constellation set as shown in Fig. 3, where the green dashed lines partition the complex plane into four symmetric decision regions each occupying an open quarter plane. Any received signals falling outside the correct region lead to detection error. Thus, the SEP requirement (20) becomes
[TABLE]
where the tuple depends on the data transmitted and equals:
- (i)
for ; 2. (ii)
for ; 3. (iii)
for ; 4. (iv)
for .
Given symbol and a target , one can determine the precise constraint region on noiseless output at MS from inequality (IV-A1). In particular, the boundary of the region is determined by equality in (IV-A1). For example, when , three points on the boundary of are identified by considering combinations of :
- (i)
: , ; 2. (ii)
: , ; 3. (iii)
:
In principle, the curved-shape boundary can be determined by traversing all possible combinations of and . The constraint region then includes all the points on and within the boundary (see the red curve in Fig. 3). Note that point is exactly a scaled constellation point. Also note that this region generally implies non-linear constraints on input signal , which makes the optimization problem less tractable. Alternatively, we can find a polytype contained in , i.e., we conservatively approximate the region using the area bounded by line segments between a finite number of points on or within boundary. A simple approximation for is given by: and , which lead to linear constraints on input signal . This region with conservative approximation is denoted as , see the open shadow area in Fig. 3.
By the same approach, relaxed constraint regions associated with the other constellation points can be derived and characterized as:
- (i)
; 2. (ii)
; 3. (iii)
,
with definition . Note that the exact areas of these regions only depend on the SEP targets.
IV-A2 16-QAM
We now turn to the case where is drawn from the 16-QAM constellation set as shown in Fig. 4. The constraint region on can be characterized based on procedures similar to those for the 4-QAM case. But calculation requires some care, since the decision regions for inner points and outer points are of different shapes (see the regions partitioned by the green dashed lines in Fig. 4) and the exact areas of these regions depend on the scaling factor . For the sake of conciseness, the derivation of the constraint region is deferred to Appendix A. Taking symbols as examples, we plot the resulting regions in Fig. 4. Again, one can approximate these regions with a polytype for each, leading to linear constraints on input signal .
In general, the exact areas of depend on the scaling factor . It is clear that, with minimum as defined by (26), constraint region corresponds to a strict equality constraint on , i.e., , for the center points ; for the side points, e.g., , shrinks to a line: and , which implies doing a zero-forcing for the imaginary part while having a relaxed constraint on the real part; and for the corner points, e.g., , becomes: , where we recall that .
To meet a given SEP target for MS , one could certainly adopt a scaling factor larger than the minimum for the transmission. But such a choice may affect the power efficiency of the system. Fig. 5 plots two instances of constraint regions for when the constellation is scaled up with (blue curves) and (red curves). It can be seen that with a larger above , when a center constellation point is transmitted, the constraint region is relaxed from a single point to a circle-type region centered on the symbol. Potential benefits can be accrued from the resulting enlarged feasible region. However, when the corner points are transmitted, the corresponding constraint regions always shrink as the constellation is scaled up. In this case, power efficiency loss may be induced because of the reduced feasible optimization space. When one of the side points is transmitted, it is unclear how the performance reacts, as the constraint region with larger partially overlaps with that for a smaller . Nevertheless our experiments have indicated that when the transmitted data symbols are randomly and uniformly generated, scaling up the constellation with brings little further power saving. Hence we will use the minimum scaling, , for the standard 16-QAM in what follows.
IV-B Problem Reformulation with Conservative Approximation
With the conservative approximation, the SEP constraints in (30) can be translated into a set of linear inequality/equality constraints on vector . We now present the resulting optimization problem.
For ease of exposition, we stack the real and the imaginary parts of each into a real vector , i.e., . For the -QAM signaling, the real and imaginary parts of coded signal are associated with different inequalities. For the -QAM signaling, with , the real and the imaginary parts of are associated with either one equality or an inequality depending on the data symbols; when , at most two inequalities are introduced for either the real or the imaginary part. Therefore, the optimization problem of (30) can be generally reformulated as follows:
[TABLE]
where for the 4-QAM system, we have , , and , while for the 16-QAM system, we have , , and . The precise definitions of the matrices and the vectors involved depend on constraint regions , , as defined in Section IV-A1 and Section IV-A2 for the 4-QAM and 16-QAM signalling, respectively. Whether or not the inequality/equality constraints are active will depend on the transmitted data associated with the constellation. It is possible that some of the constraints may not be active, in which case the corresponding row entries of / and / are padded with zeros. In addition, matrices / enjoy the same sparsity as the system matrix . In particular, when indexing matrices /, the index pair corresponds to a particular MS-subcarrier pair, and thus whenever the particular subcarrier is not allocated to the MS.
As an illustrative example, we consider a 16-QAM system with subcarriers and MSs. Each MS is allocated subcarriers: The first MS is allocated subcarriers and , while the second MS is allocated subcarriers and . The effective channel matrix is assumed to be
[TABLE]
The data symbol intended for MS and is and , respectively. Both MSs require the same SEP target and thus employ the same scaling factor . The noiseless received components at MSs are calculated as
[TABLE]
Therefore, according to the constraint regions constructed (see Section IV-A2 and also Appendix A), we have:
[TABLE]
which, by simple algebra, can be further translated into a set of general linear constraints as in (35) with , ,
[TABLE]
As we can observe, matrix inherits sparsity from the effective channel matrix ; transmission of and invokes four and two inequality constraints, respectively, and there is no strict equality constraint in this example.
V Precoding Algorithm Design with Parallel Computation Units
Problem is strictly convex and can be solved via a number of standard algorithms, such as the interior-point algorithm [30]. Most of these algorithms are designed for centralized implementation and could be efficient enough for a small-scale problem. However, as the problem dimension increases, the computational complexity may be prohibitive. More importantly, the sparsity inherent to the problem may not be well exploited in standard solvers. Given these observations, we are interested in developing a precoding algorithm that leverages the sparsity to reduce complexity and is suitable for solving large-dimension problems using parallel computation units. These units may correspond to parallel processor cores (threads) at the BS computer [31] or parallel processors at the cloud to which the BS is connected [32].
V-A Algorithm Design
We now detail the precoding algorithm with a focus on the general problem as if all inequalities and equalities in (35) were activated. The key technique is the dual decomposition approach, see, e.g., [33]. Recalling the graph representation introduced (see Fig. 2), we can map all PSNs and OSNs to parallel computation units.
We start with forming the Lagrangian function:
[TABLE]
where and are Lagrangian multipliers (dual variables), among which each pair of primal variables is associated with PSN computation unit (), while each tuple of dual variables is associated with OSN computation unit (). The dual problem is then defined as
[TABLE]
with being the dual objective function.
Then one can solve the original problem by finding the optimal dual variables in an iterative manner. Specifically, at the th iteration, for fixed dual variables and , to attain the minimization of Lagrangian, one sets the first-order derivative of the Lagrangian to zero, which leads to
[TABLE]
or more explicitly,
[TABLE]
where and denote the collection of indices of dual variables and that have interactions with , respectively, according to the graph . Note that PSN unit is in charge of computing the pair , . The corresponding dual function is given by:
[TABLE]
The dual variables are then updated by the OSN units in a parallel manner according to
[TABLE]
where and denote the collection of indices of primal variables that have interactions with and , respectively; notation denotes the projection onto the nonnegative orthant, with and
[TABLE]
This dual-variable updating rule offers faster convergence speed than the conventional gradient updating [33] as shown in [34]. The algorithm described is summarized in Table I.
V-B Complexity Analysis
To quantify the complexity of the algorithm, we distinguish the communication overhead and computational complexity.
In the algorithm, to update its primal variables via (V-A), PSN unit only needs to gather dual variables from its neighboring OSNs; therefore, the number of messages passed to PSN unit depends on the number of active dual variables and is at most for the 16-QAM, and for the 4-QAM. To update dual variables , OSN unit only needs to collect primal variables from its neighboring PSNs; therefore, the number of messages passed to OSN unit is at most for both the 4/16-QAM. Thus, with iterations, the total number of message-passings across computation units is and for the 4-QAM and 16-QAM system, respectively, where we have used the fact that with being the number of non-zeroes in each signature.
Table II summarizes the computational complexity for the algorithm proposed. To update each of its primal variables via (V-A), PSN unit needs at most additions and multiplications for the 16-QAM, and additions and multiplications for the 4-QAM. On the other hand, to update each of its dual variables via (V-A), OSN unit needs at most additions and multiplications for both the 4/16-QAM. Overall, the algorithm involves additions and multiplications for the 4-QAM system, and involves additions and multiplications for the 16-QAM system, where is the number of iterations. It is clear that the more sparse the signatures are, the less overall communication overhead and computational complexity are required to generate the precoded symbols in proposed algorithm.
For comparison, we note that the conventional ZF precoding of (21) has a computational complexity of to compute precoding matrix and additional complexity of to generate each precoded symbol vector via . Consider a transmission frame that consists of 4-QAM symbols intended for each MS and assume channel remains unchanged during the frame. The ratio of the complexity of the proposed scheme to that of the ZF approach is thus quantified by . It is clear that the smaller , the smaller will be. In particular, for a fully loaded system with and sufficiently large . As an example with , and iterations, ratio , which indicates the proposed scheme has approximately times the complexity of ZF precoding. Despite the increase in complexity, the proposed scheme is able to provide enormous transmit power reduction over ZF precoding and is more robust against imperfect channel state estimation, as will be shown later in Section VII.
VI Sparse MC-CDMA with Replica Constellations
In the previous sections, we have mainly focused on the system with standard QAM constellations for which convex constraints on the precoded vector are constructed according to the SEP targets. In this section, we assume the system adopts replica constellations, where each is the periodic extension of a regular QAM constellation along the real and imaginary axes. We propose an optimized Tomlinson-Harashima Precoding (THP) under the SEP constraints by applying a similar approach we have used for the system with standard QAM constellations.
VI-A THP-Basics
We first briefly review some basic concepts related to THP, see, e.g., [35].
In general, the replica constellation point can be represented as:
[TABLE]
where corresponds to a regular point in the scaled -QAM constellation under scaling (), and corresponds to an arbitrary integer pair, see Fig. 6 for a visual illustration when . It is noted that decision regions associated with all replica points are identical closed squares with side length .
The THP is normally done in a successive manner in which interference created by previous users’ transmissions is pre-cancelled to facilitate the transmission for the current user at each stage. The encoding is accommodated by the replica constellation and modulo-operation at the transmitter. Specifically, let the channel matrix be represented as as a result of QR factorization, where is a unitary matrix and is an upper triangular matrix. Then is a lower triangular matrix. The successive precoding operates as
[TABLE]
where is the replica point carrying information for MS and is the modulo operation operated on complex number with respect to basis and is defined as:
[TABLE]
with . The transmit signal is then formed by multiplying with , i.e., . In this way, at the receiver side, no MS experiences inter-user interference because of the pre-cancellation operations done at the BS. It is remarked that since THP is performed in a successive manner, different user orderings may lead to different performance. To find the optimal ordering, one needs to do an exhaustive search over all possible combinations, which is generally infeasible as goes large. In this work we simply adopt the suboptimal V-BLAST (VB) ordering [36].
VI-B Optimized THP
Under the SEP constraints, we can formulate a precoding optimization problem similar to . The idea is that instead of choosing a minimum scaling factor for the constellation and performing a zero-forcing THP, one can relax the constraints on the noiseless output components and introduces room for optimizing the input signals as we scale up the constellation.
In particular, with the replica constellation scaled up, the resulting constraint regions become boxes centered at each replica point as constructed and approximated similarly for the inner points of the 16-QAM constellation, see Fig. 6. The optimization problem is then formulated as:
[TABLE]
where matrices and are formed according to the constraints
[TABLE]
where parameter determines the size of the constraint box and is chosen to satisfy:
[TABLE]
and the set of information-carrying replica points is determined from the ZF-THP encoding procedure. Therefore, the precoding algorithm proposed in Section V can be applied here to calculate the optimized THP precoded vector.
VII Simulation Results
We now present numerical results to demonstrate the effectiveness of the precoding schemes proposed for the sparse MC-CDMA system.
VII-A Simulation Setup
In the simulation, the noise variance is set to unity. The total number of subcarriers is fixed with and the number of MSs is allowed to vary. Different MSs experience different frequency-selective fading channels. Specifically, the channel frequency response between the BS and MS is generated according to
[TABLE]
where represents the discrete-time channel response consisting of taps; components are modeled as independent zero-mean Gaussian random variables, whose individual variance equals with normalization factor chosen such that . In the simulation, is adopted for the channel generation. It is assumed that there is no inter-symbol-interference and inter-carrier-interference in the system.
Unless stated otherwise, for any fixed system configuration, we simulate transmission slots, under each of which random data and random channel are independently generated for each MS. In addition, we produce random regular signature matrix realizations as defined in Section II-C and thus every transmission slots share the same signature matrix. The system transmitted power consumption presented shortly is averaged over all transmission slots. For the precoding algorithm proposed, the calculation terminates at iteration if the normalized improvement of dual objective , where .
VII-B System with Standard Constellation
Fix SEP target for all MSs and vary . In the sparse MC-CDMA, different levels of sparsity, e.g., and , are considered. The case with , referred to as the dense MC-CDMA, is also considered for the purpose of comparison. In addition, we have also compared with two conventional precoding schemes including ZF and the optimized RZF in form of:
[TABLE]
where with denotes the transmitted data symbol for MS and is drawn from a scaled version of standard constellation by , is a identity matrix, and are two non-negative parameters to be optimized subject to the SEP constraints in (35). Note that the optimized RZF encompasses the conventional regularized ZF precoder [12] with and also the minimum-mean-square-error (MMSE) precoder [20] with and
Fig. 7-(a) and Fig. 7-(b) plot the transmit power consumption at the BS versus under different setups and precoding schemes for systems with 4-QAM and 16-QAM, respectively. Two important observations are made as follows.
First, power reduction from the proposed precoding over ZF and RZF is clearly evident for all load and sparsity combinations considered. In particular, for any fixed , the reduction increases as grows. For instance, when and , for the 4-QAM, we have dB and dB reduction compared with ZF and RZF, respectively, while for the 16-QAM, we have dB reduction compared with both schemes, noting that the optimized RZF solutions are degraded and coincide with the ZF solutions in this case. As , the effective channel matrix is increasingly likely to be poorly conditioned. Hence, the inefficiency of conventional schemes (in particular ZF) becomes pronounced. However, the precoding proposed is not sensitive to the conditional number of and always attains the best performance.
Second, for the sparse MC-CDMA system, there is a trend that a denser signature (a larger value of ) leads to a smaller power consumption needed. For instance, the system with consumes slightly more power than the system with to achieve the same SEP target under both the 4-QAM and 16-QAM systems. However, to attain comparable power efficiency to the dense MC-CDMA, the signatures can still be relatively sparse ( in our examples), yielding considerable reduction in precoding complexity. This observation also indicates that the sparse MC-CDMA system with proper choice of would attain almost the same link throughput as that of the dense MC-CDMA system under the same transmit power budget.
Fig. 8 plots the power consumption versus different SEPs with and , which further confirms the superiority of the proposed scheme as compared to baselines ZF and RZF under different SEP targets for both 4-QAM and 16-QAM systems.
VII-C System with Replica Constellation
We now consider the system with replica constellation, where system parameters and . All MSs request the same minimum SEP target . Under this SEP requirement, a uniform scaling factor across all MSs is chosen such that the power consumption is minimized for the proposed optimized THP. To perform the precoding optimization, we use an algorithm similar to that for systems with standard constellations. Therefore, signature sparsity is leveraged to reduce precoding complexity, as it was before.
Fig. 9-(a) and Fig. 9-(b) plot the transmit power consumption versus under both the ZF- and optimized THP schemes for the system with 4-QAM and 16-QAM replica points, respectively. The performance of the proposed scheme under standard constellations is also included here for the purpose of comparison. It is seen that the optimized THP is able to provide significant power reduction over the proposed scheme under standard constellations. This further reduction, albeit appealing, does not come for free and has to be paid with more sophisticated encoding and decoding operations in THP schemes as described in Section VI. It is also observed that the optimized THP generally outperforms ZF-THP in power efficiency and the exact gain depends on the system load. In particular, the former provides roughly dB power reduction over the latter for 4-QAM replica and roughly dB reduction for 16-QAM replica in a full-load system. The ZF-THP is already very power-efficient, yet the proposed THP is seen here to provide further reduction in transmit power.
VII-D Bit Error Rate (BER) Results and Impact of Imperfect Channel Estimation
So far, we have demonstrated the power efficiency of the proposed precoding under different uncoded SEP targets. We now evaluate the impact of the proposed scheme on another practically important performance metric in terms of uncoded bit error rate (BER). The BER is calculated and averaged over realizations of transmissions. Fig. 10 (a) and (b) depict the average BER (at a typical MS) as a function of power consumption for a system with standard/replica 4-QAM and 16-QAM, respectively. Consistent with the previous observations, the proposed optimized precoding significantly outperforms ZF precoding in terms of power efficiency to attain the same BER target under standard QAM constellations. The optimized THP is more power-efficient than the ZF-THP, and both of them generally outperform the optimized precoding under standard constellations but at the cost of increase complexity as explained before.
Next, we evaluate the impact of channel estimation error (i.e., channel uncertainty) on the performance of the schemes considered. Let denote the estimated sparse channel matrix. Each nonzero entry of is a noisy version of the perfect of . To model the uncertainty, we assume is generated according to: , where represents the complex Gaussian estimation error with variance and are independently and identically distributed. The average normalized channel uncertainty is then defined as in dB. For all schemes evaluated, the SEP target is set to so that the corresponding BER is on the order of , if perfect channel state information is available. Fig. 11 (a) and (b) depict the real BER (at a typical MS) versus different levels of channel uncertainty for a sparse MC-CDMA system with standard/replica 4-QAM and 16-QAM signaling, respectively. It can be seen that as the channel uncertainty increases, the real BER of both the ZF approach and the proposed precoding scheme degrade. However, the proposed scheme always outperforms its ZF counterpart and exhibits much better robustness against imperfect channel estimation, particularly for a system with standard constellations. The proposed scheme is thus not only more power-efficient but also more robust against channel uncertainty.
VIII Conclusions
In this work, we have introduced a sparse MC-CDMA downlink model and proposed a power-efficient precoding method under minimum symbol error probability (SEP) requirements at the MSs. It has been shown that when the system load is high, the proposed precoder significantly reduces the transmission power over (regularized) zero-forcing based precoders under the same SEP targets. It has also been shown that using the proposed precoder, the sparse MC-CDMA system with a proper choice of sparsity level attains almost the same power efficiency and link throughput as that of the dense MC-CDMA, but with lower complexity. These features, along with the fact that channel measurements may be simplified with sparse signatures, add to the practical appeal of the sparse MC-CDMA and make it a valuable candidate for future-generation wireless communication systems.
Appendix A Critical Points on the Boundary of for the 16-QAM Signaling
For the 16-QAM signaling, the SEP target (20) indicates that
[TABLE]
where and relate to the decision regions and parameterize the upper/lower bounds for the integrals. Table III specifies these parameters for different constellation points.
Given symbol and a target , one can determine the precise constraint region on noiseless output at MS from inequality (A). In particular, the boundary of the region is determined by the equality in (A). In the following, we explain how to determine a set of critical points on the boundary of the corresponding regions for three representative constellation points .
For the corner constellation point , the critical boundary points can be determined according to three different combinations of :
- (i)
: , ; 2. (ii)
: , ; 3. (iii)
:
,
where denotes the inverse of the standard -function.
For the center constellation point , the critical boundary points can be found by examining the following combinations of :
- (i)
: , ; 2. (ii)
: , ; 3. (iii)
: , ,
where , parameters and are chosen to satisfy:
[TABLE]
For the side constellation point , the critical boundary points can be found similarly by examining the following combinations of :
- (i)
: , ; 2. (ii)
:
, ; 3. (iii)
: , ,
where parameter is chosen to satisfy:
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] N. Yee, J. Linnartz, and G. Fettweis, “Multi-carrier CDMA in indoor wireless radio networks,” in Proc. of IEEE Personal Indoor and Mobile Radio Commun. , 1993.
- 2[2] S. Hara and R. Prasad, “Overview of multicarrier CDMA,” IEEE Commun. Mag. , vol. 35, no. 12, pp. 126–133, 1997.
- 3[3] A. M. Tulino, L. Li, and S. Verdú, “Spectral efficiency of multicarrier CDMA,” IEEE Trans. Inf. Theory , vol. 51, no. 2, pp. 479–505, 2005.
- 4[4] K. Fazel and S. Kaiser, Multi-carrier and spread spectrum systems: from OFDM and MC-CDMA to LTE and Wi MAX . Wiley, 2008.
- 5[5] M. Yoshida and T. Tanaka, “Analysis of sparsely-spread CDMA via statistical mechanics,” in Proc. IEEE Int. Symp. Inf. Theory , 2006.
- 6[6] A. Montanari and D. Tse, “Analysis of belief propagation for non-linear problems: The example of CDMA (or: How to prove Tanaka’s formula),” in Proc. of Inf. Theory and Appl. Workshopp , 2006.
- 7[7] J. Raymond and D. Saad, “Sparsely spread CDMA–statistical mechanics-based analysis,” J. Phys. A: Math. Theor. , vol. 40, no. 41, pp. 12 315–12 333, 2007.
- 8[8] D. Guo and C. Wang, “Multiuser detection of sparsely spread CDMA,” IEEE J. Sel. Areas Commun. , vol. 26, no. 3, pp. 421–431, 2008.
