A Supervised-Learning Detector for Multihop Distributed Reception   Systems

Seonho Kim; Song-Nam Hong

arXiv:1812.03786·cs.IT·December 11, 2018

A Supervised-Learning Detector for Multihop Distributed Reception Systems

Seonho Kim, Song-Nam Hong

PDF

Open Access

TL;DR

This paper introduces a supervised-learning detector for multihop distributed uplink systems with one-bit ADCs, leveraging a Bernoulli-like model to directly use training data for detection, outperforming existing models with lower complexity.

Contribution

The paper proposes a novel Bernoulli-like supervised-learning detector that directly uses training data without estimating complex channel functions, reducing complexity and improving performance.

Findings

01

Outperforms Gaussian-based SL detectors in one-bit quantized systems.

02

Reduces detection complexity using fast kNN algorithm.

03

Achieves attractive performance with lower computational cost.

Abstract

We consider a multihop distributed uplink reception system in which $K$ users transmit independent messages to one data center of $N_{r} \geq K$ receive antennas, with the aid of multihop intermediate relays. In particular, each antenna of the data center is equipped with one-bit analog-to-digital converts (ADCs) for the sake of power-efficiency. In this system, it is extremely challenging to develop a low-complexity detector due to the non-linearity of an end-to-end channel transfer function (created by relays' operations and one-bit ADCs). Furthermore, there is no efficient way to estimate such complex function with a limited number of training data. Motivated by this, we propose a supervised-learning (SL) detector by introducing a novel Bernoulli-like model in which training data is directly used to design a detector rather than estimating a channel transfer function. It is shown…

Equations29

r = \mbox s i g n (Φ (\tilde{x}) + \tilde{z}) \in {- 1, 1}^{N},

r = \mbox s i g n (Φ (\tilde{x}) + \tilde{z}) \in {- 1, 1}^{N},

D = {\tilde{r}_{i}^{c} \in {- 1, 1}^{N} : c \in M, i = 1, ..., T} .

D = {\tilde{r}_{i}^{c} \in {- 1, 1}^{N} : c \in M, i = 1, ..., T} .

Ψ_{D} (r) = c \in M,

Ψ_{D} (r) = c \in M,

\hat{\boldmath μ}_{c}

\hat{\boldmath μ}_{c}

\hat{Σ}_{c}

Ψ_{D} (r) = c \in M argmin (r - \hat{\boldmath μ}_{c})^{T} \hat{Σ}_{c}^{- 1} (r - \hat{\boldmath μ}_{c}) .

Ψ_{D} (r) = c \in M argmin (r - \hat{\boldmath μ}_{c})^{T} \hat{Σ}_{c}^{- 1} (r - \hat{\boldmath μ}_{c}) .

P (r ∣ c, \boldmath θ_{c}) = i = 1 \prod N ϵ_{c, i}^{1_{{r_{i} \neq = μ_{c, i}}}} (1 - ϵ_{c, i})^{1_{{r_{i} = μ_{c, i}}}},

P (r ∣ c, \boldmath θ_{c}) = i = 1 \prod N ϵ_{c, i}^{1_{{r_{i} \neq = μ_{c, i}}}} (1 - ϵ_{c, i})^{1_{{r_{i} = μ_{c, i}}}},

(\hat{\boldmath μ}_{c}, \hat{\boldmath ϵ}_{c}) = (\boldmath μ_{c}, \boldmath ϵ_{c}) argmax t = 1 \prod T P (\tilde{r}_{t}^{c} ∣ \boldmath μ_{c}, \boldmath ϵ_{c}) .

(\hat{\boldmath μ}_{c}, \hat{\boldmath ϵ}_{c}) = (\boldmath μ_{c}, \boldmath ϵ_{c}) argmax t = 1 \prod T P (\tilde{r}_{t}^{c} ∣ \boldmath μ_{c}, \boldmath ϵ_{c}) .

(\hat{\boldmath μ}_{c}, \hat{\boldmath ϵ}_{c}) = (\boldmath μ_{c}, \boldmath ϵ_{c}) argmax i = 1 \prod N t = 1 \prod T ϵ_{c, i}^{1_{{\tilde{r}_{t, i}^{c} \neq = μ_{c, i}}}} (1 - ϵ_{c, i})^{1_{{\tilde{r}_{t, i}^{c} = μ_{c, i}}}} .

(\hat{\boldmath μ}_{c}, \hat{\boldmath ϵ}_{c}) = (\boldmath μ_{c}, \boldmath ϵ_{c}) argmax i = 1 \prod N t = 1 \prod T ϵ_{c, i}^{1_{{\tilde{r}_{t, i}^{c} \neq = μ_{c, i}}}} (1 - ϵ_{c, i})^{1_{{\tilde{r}_{t, i}^{c} = μ_{c, i}}}} .

\overset{μ}{^}_{c, i} = \mbox s i g n (t = 1 \sum T \tilde{r}_{t, i}^{c}) \mbox f or i = 1, ..., N,

\overset{μ}{^}_{c, i} = \mbox s i g n (t = 1 \sum T \tilde{r}_{t, i}^{c}) \mbox f or i = 1, ..., N,

N_{d} = t = 1 \sum T 1_{{\tilde{r}_{k, i}^{c} \neq = \overset{μ}{^}_{c, i}}} \mbox an d N_{s} = t = 1 \sum T 1_{{\tilde{r}_{k, i}^{c} = \overset{μ}{^}_{c, i}}} .

N_{d} = t = 1 \sum T 1_{{\tilde{r}_{k, i}^{c} \neq = \overset{μ}{^}_{c, i}}} \mbox an d N_{s} = t = 1 \sum T 1_{{\tilde{r}_{k, i}^{c} = \overset{μ}{^}_{c, i}}} .

\overset{ϵ}{^}_{c, j} = \frac{1}{T} t = 1 \sum T 1_{{\overset{μ}{^}_{c, j} \neq = \tilde{r}_{j, i}^{c}}} .

\overset{ϵ}{^}_{c, j} = \frac{1}{T} t = 1 \sum T 1_{{\overset{μ}{^}_{c, j} \neq = \tilde{r}_{j, i}^{c}}} .

Ψ_{D} (r) = c \in M argmin (r - \hat{\boldmath μ}_{c})^{T} \mbox diag [- lo g \overset{ϵ}{^}_{c, i}] (r - \hat{\boldmath μ}_{c}),

Ψ_{D} (r) = c \in M argmin (r - \hat{\boldmath μ}_{c})^{T} \mbox diag [- lo g \overset{ϵ}{^}_{c, i}] (r - \hat{\boldmath μ}_{c}),

Ψ_{D} (r) = c \in S (r) argmin (r - \hat{\boldmath μ}_{c})^{T} \mbox d ia g [- lo g \overset{ϵ}{^}_{c, i}] (r - \hat{\boldmath μ}_{c}) .

Ψ_{D} (r) = c \in S (r) argmin (r - \hat{\boldmath μ}_{c})^{T} \mbox d ia g [- lo g \overset{ϵ}{^}_{c, i}] (r - \hat{\boldmath μ}_{c}) .

O (N L_{max} (K lo g_{J} m + 1)),

O (N L_{max} (K lo g_{J} m + 1)),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDistributed Sensor Networks and Detection Algorithms · Cooperative Communication and Network Coding · Advanced MIMO Systems Optimization

Full text

A Supervised-Learning Detector for Multihop Distributed Reception Systems

Seonho Kim and Song-Nam Hong Copyright (c) 2015 IEEE. Personal use of this material is permitted. However, permission to use this material for any other purposes must be obtained from the IEEE by sending a request to [email protected]. The authors are with the Department of Electrical Engineering, Ajou University, Suwon, Korea (e-mail: {kimsh1005, snhong}@ajou.ac.kr). This work was supported by Samsung Research Funding $\&$ Incubation Center of Samsung Electronics under Project Number SRFC-IT1702-00.

Abstract

We consider a multihop distributed uplink reception system in which $K$ users transmit independent messages to one data center of $N_{\rm r}\geq K$ receive antennas, with the aid of multihop intermediate relays. In particular, each antenna of the data center is equipped with one-bit analog-to-digital converts (ADCs) for the sake of power-efficiency. In this system, it is extremely challenging to develop a low-complexity detector due to the non-linearity of an end-to-end channel transfer function (created by relays’ operations and one-bit ADCs). Furthermore, there is no efficient way to estimate such complex function with a limited number of training data. Motivated by this, we propose a supervised-learning (SL) detector by introducing a novel Bernoulli-like model in which training data is directly used to design a detector rather than estimating a channel transfer function. It is shown that the proposed SL detector outperforms the existing SL detectors based on Gaussian model for one-bit quantized (binary observation) systems. Furthermore, we significantly reduce the complexity of the proposed SL detector using the fast kNN algorithm. Simulation results demonstrate that the proposed SL detector can yield an attractive performance with a significantly lower complexity.

Index Terms:

Multihop distributed reception system, data detection, classification, one-bit ADC.

I Introduction

A distributed uplink reception system is a special case of a multi-source single-destination multihop relay network where multiple sources send independent messages to one destination of a large number of antennas with the help of multihop intermediate relays. In this system, numerous information-theoretical approaches have been proposed in [1, 2, 3, 4], with the assumption that the destination perfectly knows all channel transfer functions (or at least end-to-end channel transfer function). A quantized-remap-and-forward (QMF) (extended in [2] where it is referred to as noisy network coding (NNC)) was presented in [1], which achieves the best-known performance. However, it is not practical as joint typical detector at the destination is prohibitive and the assumption of perfect channel state information is unrealistic. A more practical approach based on lattice code, named compute-and-forward (CoF), was presented in [5, 6, 7], which can significantly decrease the detection complexity, by converting the non-linear end-to-end channel transfer function into a linear one. However, its performance is not satisfactory for multihop relay networks with realistic channels (e.g., Rayleigh fading), due to a severe non-integer penalty [6]. Therefore, it is still an open problem to develop a practical detection and channel estimation methods for a multihop communication system.

In a distributed uplink reception system, the use of a large number of receive antennas at data center is necessary to support multiple sources simultaneously. Unfortunately, it can highly increase the hardware cost and the radio-frequency (RF) circuit consumption [8]. Especially, a high-resolution analog-to-digital converter (ADC) is most problematic as the power consumption of an ADC is scaled exponentially with the number of quantization bits and linearly with the baseband bandwidth [9, 10]. To overcome this, the use of low-resolution ADCs (e.g., 1 $\sim$ 3 bits) has received increasing attention for a large-scale multiple-input-multiple-output (MIMO) system [11, 12, 13, 14]. The one-bit ADC is particularly attractive as it does not need an automatic gain controller [15]. In this sense, we consider a multihop distributed uplink reception system in which each receive antenna of the data center is equipped with one-bit ADCs. For this system, an end-to-end channel transfer function between $K$ users and the data center is highly non-linear. Thus, it is extremely challenging to estimate such function with a limited number of one-bit quantized pilot signals. This motivates us to consider a data-driven supervised-learning (SL) detector in which the pilot signals (or training data) are exploited to directly learn a MIMO detector rather than estimating a complicated non-linear channel transfer function.

Very recently, SL detectors have been developed in [11] for MIMO systems with one-bit ADCs. It is remarkable that these methods are developed by assuming that data is generated from a Gaussian distribution. Although it is widely used, this model might not be suitable for binary data (e.g., one-bit quantized observations). In this paper, we propose a novel Bernoulli-like model which can be more suitable for binary random outputs. It is verified by showing that the proposed SL detector outperforms the existing SL detectors in [11]. Despite its superior performance, the complexity of the proposed SL detector (also, the existing SL detectors in [11]) is problematic as a search-space grows exponentially with the number of users $K$ . This is the major drawback to be used in practice. We address this problem by presenting a low-complexity SL (LSL) detector using the fast kNN algorithm. The fast kNN algorithm, which can find a closest point in Hamming space fastly using an efficient data structure, enables to efficiently remove unnecessary elements in the search-space according to a given current observation. Thus, the LSL detector can perform over the significantly reduced search-space. Simulation results demonstrate that the proposed LSL detector can yield an attractive performance with a practically manageable complexity.

This paper is organized as follows. In Section II, we describe a multihop distributed uplink reception system. In Section III, we briefly review the various existing SL detectors. In Section IV, we propose a novel SL detector based on a Bernoulli-like model. In Section V, we significantly reduce the complexity of the proposed SL detector by efficiently using the fast kNN algorithm. Section VI provides the simulation results to verify the superiority of the proposed SL detector. Finally, conclusion is provided in Section VII.

II System model

We consider a multihop distributed uplink reception system where $K$ sources transmit independent messages to one data center with the help of intermediate relays. In particular, the data center is equipped with $N_{\rm r}\geq K$ receive antennas with one-bit ADCs. Let $w_{k}\in\{{0,..,m-1}\}$ denote the source $k$ ’s message for $k\in\{1,...K\}$ , each of which contains the $\log{m}$ information bits. We also denote $m$ -ary constellation set by $S=\{s_{0},...,s_{m-1}\}$ with power constraint $\frac{1}{m}\sum_{i=0}^{m-1}|s_{i}|^{2}=P_{\rm t}$ . Let $\mbox{sign}(\cdot):\mbox{\bb R}\rightarrow\{-1,1\}$ represent the one-bit ADC quantizer function with $\mbox{sign}(u)=1$ if $u\geq 0$ and $\mbox{sign}(u)=-1$ , otherwise. Then, the transmitted symbol of the source $k$ , ${\tilde{x}}_{k}$ , is obtained by a modulation function $f:{W}\rightarrow{\cal S}$ as $\tilde{x}_{k}=f(w_{k})\in{\cal S}$ . Then, by converting a complex-valued scalar into an equivalent real-valued vector, the data center observes

[TABLE]

where $N=2N_{\rm r}$ and $\Phi(\cdot)$ represents a complex non-linear function (called end-to-end channel transfer function). Also, $\tilde{{\bf z}}=[\tilde{z}_{1},\ldots,\tilde{z}_{N}]\in\mathbb{R}^{N}$ denotes the noise vector whose elements are independent and identically distributed as circularly symmetric complex gaussian random variables with zero-mean and variance $\sigma_{z}^{2}$ , i.e., ${\tilde{z}}_{i}\sim{\cal C}{\cal N}(0,\sigma_{z}^{2})$ .

It is remarkable that $\Phi(\cdot)$ can capture all the intermediate relays’ operations and all the local wireless channels in the network. Although the proposed method in this paper can be applied to any relay’s operation and local channel model, we assume that in our simulations, each relay with a single antenna performs an amplify-and-forward (AF) and each local channel is assumed as Rayleigh fading. Also, for the simplicity, it is assumed that each relay has the same power constraint with the sources as $P_{\rm t}$ and all the additive noises at the receivers in the network are circularly symmetric complex gaussian random variables with zero-mean and variance $\sigma_{z}^{2}$ . We define the signal-to-noise ratios (SNRs) as ${\sf SNR}=P_{\rm t}/\sigma_{z}^{2}$ .

The proposed communication framework consists of training and data transmission phases (see Fig. 1). Note that during these phases, a wireless channel is assumed to be fixed.

•

Training phase: In this phase, $K$ sources transmits “known” sequences (i.e., pilot signals) so that the data center can learn a non-linear function $\Phi(\cdot)$ . With machine-learning perspective, the data center collects the data and the corresponding labels. Let ${\cal M}=\{0,\ldots,m-1\}^{K}$ denote the set of all possible messages of the $K$ sources. For each class $c\in{\cal M}$ , the $K$ sources transmit $T$ pilot signals $\tilde{{\bf x}}^{c}_{i}$ for $i=1,...,T$ . From (1), the data center can collect the labelled data set

[TABLE]

•

Data transmission phase: Given the ${\cal D}$ and a new observation ${\bf r}$ , the data center detects the class of ${\bf r}$ (i.e., users’ messages $\hat{{\bf w}}=({\hat{w}}_{1},...,\hat{w}_{K})$ ) as

[TABLE]

which is what we will propose in this paper.

III SL Detectors for Binary Data

In machine-learning perspective, the above detection problem (a.k.a., the supervised-learning problem) can be categorized into two approaches [16] as non-parametric and parametric learnings. A non-parametric learning does not require a priori knowledge on data set ${\cal D}$ (e.g., a distribution of data) such as $k$ -nearest neighbor (kNN), decision tree, and support vector machine (SVM). Whereas, in parametric learnings as logistic regression, naive bayes, and neural networks, data is assumed to be generated from a probabilistic model with some parameters (e.g., Gaussian model). Then, they are optimized from the given data set ${\cal D}$ . Therefore, it is very important to choose a proper probabilistic model based on a priori knowledge (or domain knowledge) on the data set ${\cal D}$ .

We briefly review the existing (parametric or non-parametric) SL detectors. It is noticeable that they can be immediately applied to a distributed reception system since the SL detector do not rely on system models.

•

Non-parametric learning: In [11], empirical maximum-likelihood detector (eMLD) and minimum mean distance detector (MMD) have been presented. The eMLD can be viewed as kNN classifier where the $k$ nearest data points from a new observation (or received signal) are identified and then, the majority voting is performed to find a class (e.g., users’ messages). Also, the MMD is the special case of eMLD with $k=1$ for the purpose of low-complexity.

•

Parametric learning: In this approach, it is most important to seek a proper probabilistic model for a given data set ${\cal D}$ . As in [11, 16], a Gaussian model is widely used where the data ${\bf r}\in{\cal D}$ is assumed to be generated from the probability distribution $P({\bf r}|c,\hbox{\boldmath$ \theta $}_{\rm{c}})={\cal N}\left(\hbox{\boldmath$ \mu $}_{c},\Sigma_{c}\right)$ . Here, $c\in{\cal M}$ denotes the class (or message) of the $K$ sources and $\hbox{\boldmath$ \theta $}_{c}$ represents the parameter vector for the class $c$ . Using the given ${\cal D}=\{\tilde{{\bf r}}_{t}^{c}:t=1,...,T\}$ , we can optimize $\hbox{\boldmath$ \theta $}_{c}=(\hat{\hbox{\boldmath$ \mu $}}_{c},\hat{\Sigma}_{c})$ via maximum likelihood (ML) estimation as

[TABLE]

where $\hat{\hbox{\boldmath$ \mu $}}_{c}$ and $\hat{\Sigma}_{c}$ represent the mean and the covariance of the training data associated with the class $c$ , respectively. When the training data is not sufficient, the covariance matrix tends to be rank-deficient and ill-conditioned. This problem can be resolved by shrinkage estimator [17]. Given the $\hat{\hbox{\boldmath$ \theta $}}_{c}=(\hat{\hbox{\boldmath$ \mu $}}_{c},\hat{\Sigma}_{c})$ , the optimal ML detector is derived as

[TABLE]

In particular, the distance measure in the above is referred to as Mahalanobis distance, and the inverse matrix of $\hat{\Sigma}_{c}$ in (5) is called a precision matrix. When $\Sigma_{c}={\bf I}$ for all $c$ , as a special case, the resulting detector is equivalent to the Minimum-Centered-Distance (MCD) detector proposed in [11].

It was shown in [11] that, among the above SL detectors, MCD and eMLD detectors show the best performances. Since the complexity of eMLD is higher than MCD, the latter was highly recommended. However, one can argue that Gaussian model in (6) might not be suitable to model the distribution of binary data ${\bf r}\in\{1,-1\}^{N}$ . This motivates us to propose a SL detector using a novel Bernoulli-like probabilistic model (see Section IV).

IV The Proposed (Parametric) SL Detector

We propose a novel SL detector based on a Bernoulli-like model, where data is assumed to be generated from the following probability distribution:

[TABLE]

where $\hbox{\boldmath$ \theta $}_{c}=(\hbox{\boldmath$ \mu $}_{c},\hbox{\boldmath$ \epsilon $}_{c}$ ) for $c\in{\cal M}$ , $\epsilon_{c,i}<0.5$ for all $i$ , and ${\bf 1}_{\{{\cal A}\}}$ represents an indicator function with ${\bf 1}_{\{{\cal A}\}}=1$ if ${\cal A}$ is true, and ${\bf 1}_{\{{\cal A}\}}=0$ , otherwise. Given the training data for the class $c$ (e.g., $\{\tilde{{\bf r}}_{t}^{c}:t=1,...,T\}$ ), the parameter vector $\hbox{\boldmath$ \theta $}_{c}$ is optimized using ML estimation as

[TABLE]

By plugging (7) into (8), the optimal parameters are obtained by taking the solutions of

[TABLE]

For any $\epsilon_{c,i}<0.5$ , we can see that the above objective function is maximized by taking

[TABLE]

independently from the choices of $\epsilon_{c,i}$ ’s. We let

[TABLE]

Then, we can find an optimal $\epsilon_{c,i}$ independently from the other $\epsilon_{c,j}$ ’s with $i\neq j$ by taking the solution of $\operatornamewithlimits{argmax}_{\epsilon_{c,i}}\epsilon_{c,i}^{N_{d}}(1-\epsilon_{c,i})^{N_{s}}$ . Taking $\frac{\partial(\epsilon_{c,i}^{N_{d}}(1-\epsilon_{c,i})^{N_{s}})}{\partial\epsilon_{c,i}}=0$ , the optimal $\epsilon_{c,i}$ is obtained as

[TABLE]

With the parameter vector $\hat{\hbox{\boldmath$ \theta $}}_{c}=(\hat{\hbox{\boldmath$ \mu $}}_{c},\hat{\hbox{\boldmath$ \epsilon $}}_{c})$ in (9) and (11), the optimal ML estimator (i.e., the proposed SL detector) is derived as

[TABLE]

where $\mbox{\bf diag}[d_{i}]$ denotes the diagonal matrix with the $i$ -th diagonal element $d_{i}$ and its dimension is easily obtained from the context.

V The Proposed LSL Detector

In the proposed SL detector in (12), the computational complexity is expensive as the size of search-space (e.g., $|{\cal M}|=m^{K}$ ) grows exponentially with $K$ . To address this problem, we present a low-complexity SL (LSL) detector which is performed over the reduced search-space. The major contribution of this section is to build the reduced search-space by efficiently removing unnecessary candidates from the ${\cal M}$ according to a current observation ${\bf r}$ .

The proposed method to yield the reduced search-space can be outlined as follows (see Fig 1):

Training phase: From the training data ${\cal D}=\{{\bf r}_{t}^{c}:t=1,...,T,c\in{\cal M}\}$ , the parameters for the proposed SL detector are obtained from (9) and (11) as $\hat{{\cal U}}=\{\hat{\hbox{\boldmath$ \mu $}}_{c}:c\in{\cal M}\}$ and $\hat{{\cal E}}=\{\hat{\hbox{\boldmath$ \epsilon $}_{c}}:c\in{\cal M}\}$ . Then, $\hat{{\cal U}}$ is decomposed using $k$ -medoids clustering in [18], yielding a hierarchical clustering tree (see Algorithm 1). This algorithm starts with all the elements in ${\cal D}$ and decomposes them into $J$ clusters, where $J$ is a parameter of the algorithm and called branching factor. The clusters are constructed by selecting $J$ elements randomly as cluster centroids and then by assigning other elements to one of the clusters with the closest centroid. The algorithm is repeated recursively until the number of elements in each cluster is below the maximum leaf size $J$ , where in this case, that node becomes a leaf node. In addition, Algorithm 1 is performed over $W$ times to construct the $W$ trees having possibly different decomposition structures, denoted by $\{{\cal T}_{1},\ldots,{\cal T}_{W}\}$ . The use of the multiple trees can improve the quality of the resulting reduced search space.

Data transmission phase: Given a current observation ${\bf r}$ , the search algorithm begins with traversing multiple trees in parallel. Note that $W$ multiple trees share a single priority queue ( ${\cal Q}$ ) where the nodes in the priority queue are arranged in the shortest Hamming distance order, with respect to the current observation ${\bf r}$ . Then, it can efficiently produce the reduced search-space ${\cal S}({\bf r})\subseteq{\cal M}$ which only contains the nearest $\hat{\hbox{\boldmath$ \mu $}}_{c}$ ’s to the ${\bf r}$ . The detailed procedures are given in Algorithm 2. Then, the proposed SL detector is performed as

[TABLE]

From now on, we will analyze the computational complexity of the proposed LSL detector which consists of construction and search complexities. Note that the construction complexity is taken only once during each coherence time and thus, it can be negligible when the coherence time is sufficiently large (i.e., a channel is slowly changed). Recall that $N$ denotes the observation dimensionality (e.g., $N=2N_{r}$ ) and $m^{K}$ denote the number of all possible messages of $K$ users.

Construction complexity: On the construction of $k$ -medoids hierarchical trees, the complexity of distance-computation is equal to $\mathcal{O}(m^{K}N)$ with respect to a node selected as cluster centroid. Given the branch factor $J$ , it constructs $J$ distinctive clusters at each tree level and thus, the corresponding complexity is equal to $\mathcal{O}(m^{K}NJ)$ . Assuming that $W$ multiple trees have balanced structures, the height of the tree will be ${K\log_{J}{m}}$ . Then, the overall complexity for the forest construction is equal to $\mathcal{O}({m^{K}}NJW{K\log_{J}{m}})$ .

Search complexity: First, it starts with traversing the forest $\{{\cal T}_{1},\ldots,{\cal T}_{W}\}$ simultaneously. Each node computes the distances with the $J$ child nodes to find the closest node at each level. This computation repeats until it reaches to a leaf node at level ${K\log_{J}{m}}$ for each tree ${\cal T}_{i}$ . The corresponding complexity is $\mathcal{O}(WJN{K\log_{J}{m}})$ . According to the Algorithm 2, it stops after examining $L_{max}$ elements. Except the first step, the remaining steps start with a node popped out from priority queue, which is likely to be located at a level between root and leaf. Assuming that every search returns $J$ elements in a reached leaf node and the starting point is the root, the corresponding complexity is equal to $\mathcal{O}(WJN{K\log_{J}{m}}+(L_{\rm max}-WJ)N{K\log_{J}{m}})$ . Note that this complexity is an upper bound and an actual complexity becomes lower both when leaf nodes of the trees return duplicate codes simultaneously and when trees would have skewed structures. Also, the search complexity for priority queue is negligible compared to the tree search complexity. In sum up, the overall complexity can be well-approximated as

[TABLE]

where $NL_{\rm max}$ accounts for the detection complexity in (13). It is noticeable that the complexity of the proposed low-complexity SL detector grows linear with $K$ while the other SL detectors grow exponentially with $K$ . The approximated complexity in (14) will be used in Section VI to compute the complexity of the proposed LSL detector.

VI Numerical results

We evaluate the average bit-error-rate (BER) performances of the proposed SL detector over the existing SL detectors in [11]. Also, it is shown that the proposed low-complexity SL detector achieves the original performance with much lower complexity. For the simulations, QPSK modulation and Rayleigh fading are assumed. When a training overhead is small (e.g., $T$ is small), an empirical error-probability (e.g., $\epsilon_{c,i}$ ) can be underestimated as zero although it is indeed not. Since this can cause severe error-floor problem, we assign a minimum value of $\hat{\epsilon}_{c,i}$ as $10^{-3}$ .

Fig. 2 shows the BER performances of the proposed SL detector and the existing one in [11]. Here, the number of training for each $c$ is set by 15 (e.g., $T=15$ ). We considered the two hop distributed reception network for the following two scenarios: i) 64 intermediate relays; ii) 128 intermediate relays. From Fig. 2, we can observe that the proposed SL detector outperforms the existing SL detector, which implies that the proposed Bernoulli-like model is more suitable to binary data than Gaussian model.

Fig. 3 shows the BER performances of the proposed low-complexity detector according to $L_{\rm max}$ in Algorithm 2. Also, we set $J=32$ in Algorithm 1. In this simulation, the benefit of low-complexity detector stands out, since it can achieve the optimal performance perfectly with only $6\%$ of original complexity. Thus, it is expected that the use of low-complexity technique is more beneficial for a large-scale distributed reception system (e.g., a large $K$ ).

VII Conclusion

We proposed a supervised-learning (SL) detector by introducing a novel Bernoulli-like model as data probability distribution. Differently from a widely used Gaussian model, it can exploit the structure of binary data (e.g., one-bit quantized observation). We further developed the low-complexity SL detector with the aid of the fast kNN algorithm. Simulation results demonstrated that the proposed low-complexity detector almost achieves the original performance with a significantly lower complexity. Therefore, the proposed detector would be a good practical candidate for multihop distributed reception systems. We would like to emphasize that the proposed SL detector can be straightforwardly applied to any multihop relay network with a single destination. On going work, we are investigating to generalize the Bernoulli-like model by capturing the correlation of elements in an observed data. Also, it is an interesting future work to extend the proposed SL detector for the multihop communication systems with low-resolution ADCs (e.g., 2 $\sim$ 3-bit ADCs).

Bibliography18

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] S. Avestimehr, S. Diggavi, and D. Tse, “Wireless network information flow: a deterministic approach,” IEEE Trans. Inf. Theory, vol. 57, no. 4, pp. 1872-1905, Apr. 2011.
2[2] S. Lim, Y. H. Kim, A. E. Gamal, and S. Chung, “Noisy network coding,” IEEE Trans. Inf. Theory, vol. 57, no. 5, pp. 3132-3152, May 2011.
3[3] S.-H. Park, O. Simeone, O. Sahin, and S. Shamai, “Robust and efficient distributed compression for cloud radio access networks,” IEEE Trans. Veh. Tech., vol. 62, no. 2, pp. 692-703, Feb. 2013.
4[4] S.-H. Park, O. Simeone, O. Sahin, and S. Shamai, “Robust layered transmission and compression for distributed uplink reception in cloud radio access networks,” IEEE Trans. Veh. Tech., vol. 63, no. 1, pp. 204-216, Jan. 2014.
5[5] B. Bobak and M. Gastpar, “Compute-and-forward: harnessing interference through structured codes,” IEEE Trans. Inf. Theory, vol. 57, no. 10, pp. 6463-6486, Oct. 2011.
6[6] S.-N. Hong and G. Caire, “Compute-and-forward strategies for cooperative distributed antenna systems,” IEEE Trans. Inf. Theory, vol. 59, no. 9, pp. 5227–5243, Sep. 2013.
7[7] S.-N. Hong, Y.-S. Jeon and N. Lee, “Distributed uplink reception in cloud radio access networks: a linear coding approach,” IEEE Trans. Veh. Tech., vol. 67, no. 2, pp. 1470-1481, Feb. 2018.
8[8] Y. Hong and T. L. Marzetta, “Total energy efficiency of cellular large scale antenna system multiple access mobile networks,” in Proc. IEEE Conf. Green Commun. (Green Com), pp. 29-31, Piscataway, NJ, Oct. 2013.