Deep Learning for Signal Demodulation in Physical Layer Wireless   Communications: Prototype Platform, Open Dataset, and Analytics

Hongmei Wang; Zhenzhen Wu; Shuai Ma; Songtao Lu; Han Zhang; Guoru; Ding; and Shiyin Li

arXiv:1903.04297·eess.SP·March 12, 2019

Deep Learning for Signal Demodulation in Physical Layer Wireless Communications: Prototype Platform, Open Dataset, and Analytics

Hongmei Wang, Zhenzhen Wu, Shuai Ma, Songtao Lu, Han Zhang, Guoru, Ding, and Shiyin Li

PDF

TL;DR

This paper introduces a new open dataset and prototype platform for wireless signal demodulation, proposing two deep learning-based demodulators that outperform traditional methods in real-world experiments.

Contribution

The paper presents the first open dataset of real wireless signals and develops two novel DL-based demodulators combining DBN-SVM and AdaBoost techniques.

Findings

01

DBN-SVM demodulator outperforms traditional classifiers

02

AdaBoost demodulator achieves higher accuracy with KNN weak classifiers

03

Proposed methods surpass existing single-classifier approaches

Abstract

In this paper, we investigate deep learning (DL)-enabled signal demodulation methods and establish the first open dataset of real modulated signals for wireless communication systems. Specifically, we propose a flexible communication prototype platform for measuring real modulation dataset. Then, based on the measured dataset, two DL-based demodulators, called deep belief network (DBN)-support vector machine (SVM) demodulator and adaptive boosting (AdaBoost) based demodulator, are proposed. The proposed DBN-SVM based demodulator exploits the advantages of both DBN and SVM, i.e., the advantage of DBN as a feature extractor and SVM as a feature classifier. In DBN-SVM based demodulator, the received signals are normalized before being fed to the DBN network. Furthermore, an AdaBoost based demodulator is developed, which employs the $k$ -Nearest Neighbor (KNN) as a weak classifier to form a…

Tables1

Table 1. TABLE I: Experimental equipment and parameters

Experiment setup	Type and parameters
EXG RF vector signal generator	Keysight N5172B
MXA vector signal analyzer	Keysight N9020B
Antenna Gain	24 $dBi$

Equations48

x (t) = V_{m} cos (2 π f_{c} t + θ_{m}), m = 1, ..., M, 1 \leq t \leq T,

x (t) = V_{m} cos (2 π f_{c} t + θ_{m}), m = 1, ..., M, 1 \leq t \leq T,

y (t) = g (t) x (t) + n_{r} (t),

y (t) = g (t) x (t) + n_{r} (t),

\overset{y}{^}_{i} = \frac{y _{i} - y _{m i n}}{y _{m a x} - y _{m i n}}, 1 \leq i \leq N L,

\overset{y}{^}_{i} = \frac{y _{i} - y _{m i n}}{y _{m a x} - y _{m i n}}, 1 \leq i \leq N L,

E (v_{k}, h_{k}) = - h_{k}^{T} W_{k} v_{k} - a_{k}^{T} v_{k} - b_{k}^{T} h_{k},

E (v_{k}, h_{k}) = - h_{k}^{T} W_{k} v_{k} - a_{k}^{T} v_{k} - b_{k}^{T} h_{k},

p (v_{k}) = \frac{1}{Z _{k}} h_{k} \sum e^{E (v_{k}, h_{k})},

p (v_{k}) = \frac{1}{Z _{k}} h_{k} \sum e^{E (v_{k}, h_{k})},

W_{k}, a_{k}, b_{k} max v_{k} \sum lo g p (v_{k}) .

W_{k}, a_{k}, b_{k} max v_{k} \sum lo g p (v_{k}) .

\frac{\partial lo g p ( v _{k} )}{\partial w _{k, β}^{(α)}}

\frac{\partial lo g p ( v _{k} )}{\partial w _{k, β}^{(α)}}

- v_{k} \sum p (v_{k})

\frac{\partial lo g p ( v _{k} )}{\partial a _{k, α}}

\frac{\partial lo g p ( v _{k} )}{\partial b _{k, β}}

- v_{k} \sum p (v_{k})

p (h_{k, β} = 1∣ v_{k}) = sigmoid (b_{k, β} + v_{k}^{T} w_{k, β}),

p (h_{k, β} = 1∣ v_{k}) = sigmoid (b_{k, β} + v_{k}^{T} w_{k, β}),

p (v_{k, α} = 1 ∣ h_{k}) = sigmoid a_{k, α} + β = 1 \sum N_{k} h_{k, β} w_{k, β}^{(α)},

w_{k + 1, β}^{(α)}

w_{k + 1, β}^{(α)}

a_{k + 1, α}

b_{k + 1, β}

G_{q} (\overset{ˉ}{y}_{l_{1}}, \overset{ˉ}{y}_{l_{2}}) = exp (- \frac{y ˉ _{l_{1}} - y ˉ _{l_{2}} ^{2}}{2 σ _{q}^{2}}),

G_{q} (\overset{ˉ}{y}_{l_{1}}, \overset{ˉ}{y}_{l_{2}}) = exp (- \frac{y ˉ _{l_{1}} - y ˉ _{l_{2}} ^{2}}{2 σ _{q}^{2}}),

c_{q} min

c_{q} min

s . t .

0 \leq c_{q, l_{1}} \leq K, l_{1} \in L_{1},

f_{q} (\overset{ˉ}{y}_{l_{1}}) = γ (i = 1 \sum L_{1} c_{q, i}^{*} z_{i} exp (- \frac{y ˉ _{i} - y ˉ _{l_{1}} ^{2}}{2 σ _{q}^{2}}) + b_{q}^{*}),

f_{q} (\overset{ˉ}{y}_{l_{1}}) = γ (i = 1 \sum L_{1} c_{q, i}^{*} z_{i} exp (- \frac{y ˉ _{i} - y ˉ _{l_{1}} ^{2}}{2 σ _{q}^{2}}) + b_{q}^{*}),

\displaystyle\gamma\left(x\right)\buildrel\Delta\over{=}\left\{{\begin{array}[]{*{20}{l}}{1,\quad\rm{if}\quad x\geq 0}\\ {0,\quad\rm{if}\quad x<0}\end{array}}\right..

\displaystyle\gamma\left(x\right)\buildrel\Delta\over{=}\left\{{\begin{array}[]{*{20}{l}}{1,\quad\rm{if}\quad x\geq 0}\\ {0,\quad\rm{if}\quad x<0}\end{array}}\right..

\overset{z}{^} = m \in M ar g max {u_{m}} .

\overset{z}{^} = m \in M ar g max {u_{m}} .

l^{*} = i \in L_{1} ar g min \tilde{y}_{d_{i}} - \hat{y}_{l}_{2}, l \in L_{1}, d \in D .

l^{*} = i \in L_{1} ar g min \tilde{y}_{d_{i}} - \hat{y}_{l}_{2}, l \in L_{1}, d \in D .

χ_{d} = l = 1 \sum L_{1} w_{d} (l) I (f_{d} (\hat{y}_{l}), z_{l}), d \in D,

χ_{d} = l = 1 \sum L_{1} w_{d} (l) I (f_{d} (\hat{y}_{l}), z_{l}), d \in D,

\displaystyle I\left({x,y}\right)=\left\{{\begin{array}[]{*{20}{l}}{1,\quad{\text{if}}\quad x\neq y}\\ {0,\quad{\text{if}}\quad x=y}.\end{array}}\right.

\displaystyle I\left({x,y}\right)=\left\{{\begin{array}[]{*{20}{l}}{1,\quad{\text{if}}\quad x\neq y}\\ {0,\quad{\text{if}}\quad x=y}.\end{array}}\right.

w_{d + 1} (l) = \frac{w _{d} ( l ) exp ( - α _{d} I ( f _{d} ( y ^ _{l} ) , z _{l} ) )}{Q _{d}},

w_{d + 1} (l) = \frac{w _{d} ( l ) exp ( - α _{d} I ( f _{d} ( y ^ _{l} ) , z _{l} ) )}{Q _{d}},

l \in L_{1}, d \in D,

F (\hat{y}_{l}) = \overset{z_{l}}{^} = z_{l} \in Φ ar g max d = 1 \sum D α_{d} (1 - I (f_{d} (\hat{y}_{l}), z_{l})),

F (\hat{y}_{l}) = \overset{z_{l}}{^} = z_{l} \in Φ ar g max d = 1 \sum D α_{d} (1 - I (f_{d} (\hat{y}_{l}), z_{l})),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSupport Vector Machine

Full text

Deep Learning for Signal Demodulation in Physical Layer Wireless Communications: Prototype Platform, Open Dataset, and Analytics

Hongmei Wang, Zhenzhen Wu, Shuai Ma, Songtao Lu, Han Zhang, Guoru Ding, and Shiyin Li Manuscript received January 22, 2019. Corresponding author: Shuai Ma.)H. Wang, Z. Wu, S. Ma, and S. Li are with Information and Control Engineering, China University of Mining and Technology, Xuzhou, China 221116 (Email:[email protected], [email protected], [email protected], [email protected]).S. Ma is also with the State Key Laboratory of Integrated Services Networks, Xidian University, Xi’an 710071, China.S. Lu is with the Department of Electrical and Computer Engineering, University of Minnesota, Minneapolis, MN 55455, USA (e-mail: [email protected]).H. Zhang is with the Department of Electrical and Computer Engineering, University of California, Davis, CA 95616, USA (e-mail: [email protected]).G. Ding is with the College of Communications Engineering, Army Engineering University, Nanjing 210007, China (e-mail: [email protected])The work of S. Ma and S. Li were supported by the Fundamental Research Funds for the Central Universities under Grant 2017QNA32,by the National Natural Science Foundation of China under Grant 61701501, Grant 61771474; by the Natural Science Foundation of Jiangsu Province under Grant BK20170287; by China Postdoctoral Science Foundation under Grant 2016M600452; by the State Key Laboratory of Integrated Services Networks (Xidian University) under grant ISN19-07; and by the Key Laboratory of Cognitive Radio and Information Processing , Ministry of Education (Guilin University of Electronic Technology) under grant CRKL180204, by Key Laboratory of Ocean Observation-Imaging Testbed of Zhejiang Province. The work of H. Wang was supported by the National Natural Science Foundation of China under Grant 61601464.

Abstract

In this paper, we investigate deep learning (DL)-enabled signal demodulation methods and establish the first open dataset of real modulated signals for wireless communication systems. Specifically, we propose a flexible communication prototype platform for measuring real modulation dataset. Then, based on the measured dataset, two DL-based demodulators, called deep belief network (DBN)-support vector machine (SVM) demodulator and adaptive boosting (AdaBoost) based demodulator, are proposed. The proposed DBN-SVM based demodulator exploits the advantages of both DBN and SVM, i.e., the advantage of DBN as a feature extractor and SVM as a feature classifier. In DBN-SVM based demodulator, the received signals are normalized before being fed to the DBN network. Furthermore, an AdaBoost based demodulator is developed, which employs the $k$ -Nearest Neighbor (KNN) as a weak classifier to form a strong combined classifier. Finally, experimental results indicate that the proposed DBN-SVM based demodulator and AdaBoost based demodulator are superior to the single classification method using DBN, SVM, and maximum likelihood (MLD) based demodulator.

Index Terms:

Machine learning, DBN-SVM based demodulator, AdaBoost based demodulator.

I Introduction

Conventional wireless communication systems are generally designed in accordance with the rigorous mathematical theories and accurate system models [1]. However, because of increasing wireless service requirements, such as the use of smartphones, virtual reality, and internet of things (IoT), it is challenging to characterize future complex wireless communication networks accurately by using tractable mathematical models or system models [2]. Recently, deep learning (DL) [3], as an effective method to handle complex problems, has attracted increasing attention from both academia and industry. DL has been applied in image recognition [4, 5], computer vision [6], natural language processing [7] and spectrum prediction [8], etc. In addition, some literatures have focused on using DL to optimize performance of wireless communication systems [9, 10, 11]. In [9], an unsupervised learning-based fast beamforming method is proposed to maximize the weighted sum rate under the total power constraint. In [10], a deep recurrent neural network based algorithm is proposed to tackle energy efficient resource allocation problem for heterogeneous IoT. In [11], a three dimensional message-passing algorithm based on deep learning scheme is proposed to minimize the weighted sum of the secondary interference power for cognitive radio networks. Recent works [12, 13] have interpreted an end-to-end wireless communication system as an auto-encoder. This is promising for applications of DL to wireless communications.

Demodulation is one of the fundamental modules for wireless communications systems for high-speed transmission with a low bit error rate. Theoretically, optimum demodulators of conventional wireless communication systems are designed for additive white Gaussian noise (AWGN) channels [1]. Moreover, both channel state information (CSI) and channel noise distribution are usually required. Most previous studies [14, 15, 16, 17, 18, 19] have assumed that each receiver can accurately estimate the fading coefficients. However, practical wireless communication channels may suffer from multi-path fading, impulse noise, spurious or continuous jamming, and numerous other complex impairments, which deteriorate demodulation performance significantly. Because of the limited length of the training sequence, the estimate CSI will have limited accuracy [20]. Especially, for fast-fading scenarios, it is difficult to accurately estimate CSI because the fading coefficients change rapidly during the data transmission period. Designing optimum demodulators for different channel models is challenging because the channel model may not be known at the receiver end.

Given the above issues, DL-based model-free demodulators have attracted a considerable amount of attention, where the requirements for a priori knowledge can be widely relaxed or even removed [21]. Because the information of the modulated signals is represented by the amplitude and phase, feature extraction is of critical importance for signal demodulation.

DL-based demodulators have been investigated in conventional radio frequency (RF) systems. In [22], a deep convolutional network demodulator (DCND) is proposed to demodulate mixed modulated signals, which can further reduce the bit error rate compared with the coherent demodulation method. In [23], the authors show that the proposed demodulator based on deep belief network (DBN) is feasible for an AWGN channel with a certain channel impulse response and a Rayleigh non-frequency-selective flat fading channel. In [24], a DL-based detector is proposed for signal demodulation in short-range multi-channels without a signal equalizer. In [25], the authors show that deep convolutional neural networks (DCNN) for frequency-shift keying (FSK) demodulation can substantially reduce error bit probabilities over an AWGN Rayleigh-fading channel. To the best of our knowledge, most of existing DL-based demodulation schemes are based on simulated data rather than real measured data.

This paper presents a data-driven framework for DL-based demodulators. Specifically, two data-driven demodulation methods based on DBN-support vector machine (SVM) and adaptive boosting (AdaBoost) [26] are developed for end-to-end wireless communication systems. These methods learn and extract features from the received modulation signals without any prior knowledge of the channel model. Moreover, the performance of the two data-driven demodulators are evaluated on different modulation schemes through real measured data. The main contributions of this paper are as follows:

•

A flexible end-to-end wireless communication prototype platform is developed for application in real physical environments, which can generate real signals. The prototype is used to establish measured modulation datasets from real communication systems in actual physical environments in eight modulation schemes, i.e., binary phase shift keying (BPSK) and multiple quadrature amplitude modulation ( $M$ -QAM) modulation, where $M={2^{\phi}}$ and $\phi=\left\{{2,3,4,5,6,7,8}\right\}$ . The received SNR of the eight modulated signals are measured from $3$ dB to $25$ dB. An open online real modulated dataset is established, available at https://pan.baidu.com/s/1biDooH6E81Toxa2u4D3p2g or https://drive.google.com/open?id=1jXO9OMZOyVMOYv QSn3WVmlfQoQbonKuo , where the transmission distance of the eight modulated signals is measured in an indoor environment. To the best of our knowledge, this is the first open dataset of real modulated signals for wireless communication systems.

•

Then, based on the measured data, two DL-based demodulators are proposed, namely, DBN-SVM based demodulator and AdaBoost based demodulator. The proposed DBN-SVM based demodulator, which has a novel demodulation architecture, exploits the advantages of both DBN and SVM, i.e., the advantage of DBN as a feature extractor and SVM as a feature classifier. To accelerate the convergence rate, the received signals are first normalized before being fed to the DBN network so that the features of the received signals can be extracted, the SVM is utilized to classify these features.

•

An AdaBoost based demodulator, which utilizes multiple KNNs as a weak classifier to form a strong combined classifier, is developed. The proposed AdaBoost based demodulator increases the weights for the error demodulated symbols and decreases the corresponding weights for correctly demodulated symbols during the iterations.

•

Finally, the demodulation performance of the two proposed data-driven demodulators are investigated. Specifically, the demodulation accuracies of the two DL-based demodulators decrease over the respective transmission and modulation orders for a fixed transmission distance. The experimental results also show that the demodulation accuracy of the DBN-SVM based demodulator is higher than those of DBN-based and SVM-based demodulators. Moreover, the demodulation accuracy of the AdaBoost based demodulator is higher than that of the DBN-SVM based demodulator at the lower SNR regions, and the accuracies of the two demodulators are similar at high SNRs. For the high SNR scenario, a high-order modulation is generally preferred.

The remainder of this paper is organized as follows. Section II describes the system model. Section III explores the structures of the DBN-SVM and AdaBoost, including detailed descriptions of the data stream and how to make classification decisions. In section IV, the data analysis results are provided and analyzed. Finally, the conclusions from the study are drawn in Section V.

Notations: Boldfaced lowercase and uppercase letters represent vectors and matrices, respectively. The transpose of a matrix is denoted as ${\left(\cdot\right)^{\rm{T}}}$ . $\mathcal{L}\buildrel\Delta\over{=}\{1,2,...,L\}$ , $\mathcal{L}_{1}\buildrel\Delta\over{=}\{1,...,L_{1}\}$ , $\mathcal{M}_{k}\buildrel\Delta\over{=}\left\{{1,2,...,{M_{k}}}\right\}$ , $\mathcal{N}_{k}\buildrel\Delta\over{=}\left\{{1,2,...,{N_{k}}}\right\}$ , $\mathcal{D}\buildrel\Delta\over{=}\{1,...,D\}$ , $\mathcal{Q}\buildrel\Delta\over{=}\{0,1,\ldots,\bar{M}\}$ , and $\mathcal{M}\buildrel\Delta\over{=}\left\{{0,1,\ldots,M-1}\right\}$ .

II System Model

An end-to-end wireless communication system111The term end-to-end wireless system model implies that signal features are learned from a single deep neural network, without the complex multi-stage expert machine learning processing[12, 13, 27, 28, 29]. is considered, which includes a single antenna transmitter and a single antenna receiver, as illustrated in Fig. 1. By adopting the BPSK or $M$ -QAM digital modulation schemes, the transmitted signal $x\left(t\right)$ is given as

[TABLE]

where ${V_{m}}$ , ${{\theta_{m}}}$ and $T$ denote the amplitude, phase, and period of the signal $x\left(t\right)$ , respectively; ${f_{c}}$ is the carrier frequency.

Let $g\left(t\right)$ denote the multipath channel between the transmitter and the receiver, which may suffer nonlinear distortion, interference, and frequency selective fading. At the receiver, the received signal $y\left(t\right)$ is given by

[TABLE]

where ${n_{r}}\left(t\right)$ denotes the received noise.

Then, the received analog signal $y\left(t\right)$ is converted to the digital signal via the vector signal analyzer. Let ${\bf{y}}\buildrel\Delta\over{=}{\left[{{y_{1}},{y_{2}},...,{y_{NL}}}\right]^{T}}$ denote the total sampled digital signal vector, where ${y_{n}}=y\left({\frac{{n-1}}{N}T}\right)$ is the $n$ th sample, $N$ is the number of samples of one period, and $L$ denotes the number of signal periods.

Before the demodulation process, the received signal ${\bf{y}}$ is normalized to $[0,1]$ , which can accelerate the DL network processing speed[30]. Senerally, the normalized data ${\bf{\hat{y}}}\buildrel\Delta\over{=}{\left[{{{\hat{y}}_{1}},{{\hat{y}}_{2}},...,{{\hat{y}}_{NL}}}\right]^{T}}$ is given by

[TABLE]

where ${y_{\min}}=\mathop{\min}\limits_{1\leq i\leq N{L}}{y_{i}}$ , and ${y_{\max}}=\mathop{\max}\limits_{1\leq i\leq N{L}}{y_{i}}$ .

Because the information of the BPSK and $M$ -QAM are represented by amplitudes and phases, DL is used to extract information features from the received signals. Specifically, with the sampled signal vector ${\mathbf{y}}$ , two DL-based demodulators are proposed: DBN-SVM based demodulator and AdaBoost based demodulator. The DL-based demodulators consist of two phases: training phase and testing phase. During the training phase, the parameters of the DL-based demodulators are optimized with the training dataset. Then, in the testing phase, the demodulators demodulate the received signal and recover the transmitted information.

Let ${{z_{l}}}$ denote the label signal of the $l$ th period, where $l\in\mathcal{L}$ and $\Phi$ is the label set, i.e., $\Phi=\left\{{{z_{1}},{z_{2}},\ldots,{z_{L}}}\right\}$ , which is determined by the modulation scheme. Let ${\rm{{\cal T}_{1}}}=\left\{{\left({{{{\bf{\hat{y}}}}_{1}},{z_{1}}}\right),\left({{{{\bf{\hat{y}}}}_{2}},{z_{2}}}\right),\ldots,\left({{{{\bf{\hat{y}}}}_{{L_{1}}}},{z_{{L_{1}}}}}\right)}\right\}$ denote the labeled training signal set, where ${{{\bf{\hat{y}}}}_{l}}={\left[{{{\hat{y}}_{1+\left({l-1}\right)N}},{{\hat{y}}_{2+\left({l-1}\right)N}},...,{{\hat{y}}_{lN}}}\right]^{T}}$ denotes the normalized signal of the $l$ th period, and $L_{1}$ denotes the total number of training signal periods $\left({{L_{1}}<L}\right)$ .

III DBN-SVM based Demodulator

As an unsupervised features extraction method, the DBN can efficiently extract high-level and hierarchical features from the measured signal, while the SVM minimizes the structure risk and shows good learning and generalization performance with a small amount of samples. Inspired by those advantages of the two approaches, a combination of DBN and SVM for demodulation is proposed. The DBN-SVM demodulator is shown in Fig. 2, the DBN is used as a feature generator and the SVM is used as a classifier.

III-1 DBN

The proposed DBN includes three stacked restricted Boltzmann machines (RBM)[31], i.e., RBM1, RBM2, and RBM3, as shown in Fig. 2. Specifically, RBMk is an undirected, bipartite graphical model, and it composes a visible layer ${{\bf{v}}_{k}}=[v_{k,1},v_{k,2},...,v_{k,{M_{k}}}]^{T}$ and a hidden layer ${{\bf{h}}_{k}}=[h_{k,1},h_{k,2},...,h_{k,{N_{k}}}]^{T}$ , where $v_{k,\alpha}$ and $h_{k,\beta}$ are the $\alpha$ th neuron of ${{\bf{v}}_{k}}$ and the $\beta$ th neuron of ${{\bf{h}}_{k}}$ , respectively, $\alpha\in\mathcal{M}_{k}$ , $\beta\in\mathcal{N}_{k}$ , $k\in\left\{{1,2,3}\right\}$ . The visible layer ${{\bf{v}}_{k}}$ and hidden layer ${\bf{h}}_{k}$ are fully connected via a symmetric undirected weighted matrix ${{\bf{W}}_{k}}=\left[{{{\bf{w}}_{k,1}},{{\bf{w}}_{k,2}},\ldots,{{\bf{w}}_{k,{N_{k}}}}}\right]^{T}$ , where ${{\bf{w}}_{k,\beta}}={[{w_{k,\beta}^{\left(1\right)},w_{k,\beta}^{\left(2\right)},\ldots,w_{{k},\beta}^{\left(M_{k}\right)}}]^{T}}$ is a weight vector between ${{\bf{v}}_{k}}$ and ${h_{k,\beta}}$ . For the three RBM, there is no intralayer connections between either the visible layer or the hidden layer.

For RBMk, the energy $E\left({{{\bf{v}}_{k}},{{\bf{h}}_{k}}}\right)$ is defined by combining the configuration of both ${{\bf{v}}_{k}}$ and ${{\bf{h}}_{k}}$ as follows

[TABLE]

where ${{\bf{a}}_{k}}={\left[{a_{k,1},a_{k,2},...,a_{k,{{M_{K}}}}}\right]^{T}}$ is an offset vector of ${{\bf{v}}_{k}}$ , and ${{\bf{b}}_{k}}={\left[{b_{k,1},b_{k,2},...,b_{k,{{N_{K}}}}}\right]^{T}}$ is an offset vector of ${{\bf{h}}_{k}}$ .

Based on $E\left({{{\bf{v}}_{k}},{{\bf{h}}_{k}}}\right)$ , the probability of ${{\bf{v}}_{k}}$ is given by

[TABLE]

where ${Z_{k}}=\sum\limits_{{{\bf{v}}_{k}},{{\bf{h}}_{k}}}{{e^{E\left({{{\bf{v}}_{k}},{{\bf{h}}_{k}}}\right)}}}$ is the normalization factor.

During the training phrase, the goal of the RBMk is to maximize the log-likelihood function as follows

[TABLE]

To solve equation (6), the gradient descent method is used to iteratively calculate the variables ${\bf{W}}_{k}$ , ${\bf{a}}_{k}$ , and ${\bf{b}}_{k}$ , where the corresponding partial derivative with respect to $\mathbf{W}_{k}$ , $\mathbf{a}_{k}$ , and $\mathbf{b}_{k}$ can be written as

[TABLE]

According to [32], the conditional probability $p\left({h_{k,\beta}=1|{{\bf{v}}_{k}}}\right)$ and $p\left(v_{k,\alpha}=1\left|{{{\bf{h}}_{k}}}\right.\right)$ are respectively given by

[TABLE]

where ${\rm{sigmoid}}\left(x\right)\buildrel\Delta\over{=}\frac{1}{{1+{e^{-x}}}}$ , $\alpha\in\mathcal{M}_{k}$ , $\beta\in\mathcal{N}_{k}$ , $h_{k,\beta}$ , and $v_{k,\alpha}\in\left[{0,1}\right]$ .

Then, the variables ${\bf{W}}_{k}$ , ${\bf{a}}_{k}$ , and ${\bf{b}}_{k}$ are updated by the following equations[33]

[TABLE]

where $\eta>0$ is the learning rate.

By employing the gradient descent method, RBM1 is trained first, where ${{\bf{v}}_{1}}={\bf{\hat{y}}}_{l}$ and $l\in\mathcal{L}_{1}$ . Then, let ${{\bf{v}}_{2}}={{\bf{h}}_{1}}$ , and RBM2 is trained. Similarly, after training RBM2, let ${{\bf{v}}_{3}}={{\bf{h}}_{2}}$ , and RBM3 is trained. Moreover, when RBM3 is trained, the parameters of DBN can be obtained, i.e., ${\left\{{{{\bf{W}}_{k}},{{\bf{a}}_{k}},{{\bf{b}}_{k}}}\right\}_{k\in\{1,2,3\}}}$ . Then, the parameters ${\left\{{{{\bf{W}}_{k}},{{\bf{a}}_{k}},{{\bf{b}}_{k}}}\right\}_{k\in\{1,2,3\}}}$ are further fine-tuned by the supervised back propagation (BP) algorithm [34].

After DBN is trained, it outputs the extracted feature ${{{\bf{\bar{y}}}}_{l_{1}}}={{\bf{h}}_{3}}$ , where $l_{1}\in\mathcal{L}_{1}$ . Let ${\bf{\bar{Y}}}={\left[{{{{\bf{\bar{y}}}}_{1}},{{{\bf{\bar{y}}}}_{2}},\ldots,{{{\bf{\bar{y}}}}_{{L_{1}}}}}\right]^{T}}$ denote the output feature set.

III-2 OVO-SVM

With the extracted feature set ${\bf{\bar{Y}}}$ , the one-versus-one (OVO)-SVM is adopted for further classification, which achieves multiclassification by solving the two-classification subproblems [35, 36]. As shown in Fig. 2, OVO-SVM exploits ${\bar{M}}$ nonlinear two-class SVMs, i.e., SVM0,…,SVM ${}_{\bar{M}}$ , to classify $M$ categories for $M$ -QAM modulation, where $\bar{M}\buildrel\Delta\over{=}\frac{{M\left({M-1}\right)}}{2}-1$ .

To map pedestrian features to a high dimensional space, a Gaussian kernel is introduced, which can be expressed as

[TABLE]

where $\sigma_{q}>0$ is the bandwidth of the Gaussian kernel and $q\in\mathcal{Q}$ .

According to the nonlinear SVM theory [37], the nonlinear two-class SVMq problem can be formulated as

[TABLE]

where ${{\bf{c}}_{q}}={\left[{{c_{q,1}},{c_{q,2}},\ldots,{c_{q,{L_{1}}}}}\right]^{T}}$ and $q\in\mathcal{Q}$ .

By solving linear programming (11), the optimal solution ${\bf{c}}_{q}^{*}={\left[{c_{q,1}^{*},c_{q,2}^{*},\ldots,c_{q,{L_{1}}}^{*}}\right]^{T}}$ is obtained. Then, the nonlinear two-class SVMq decision function ${f_{q}}\left({{{{\bf{\bar{y}}}}_{l_{1}}}}\right)$ , with $l_{1}\in\mathcal{L}_{1}$ , $q\in\mathcal{Q}$ , is given as

[TABLE]

where $b_{q}^{*}\buildrel\Delta\over{=}{z_{l_{1}}}-\sum\limits_{i=1}^{{L_{1}}}{c_{q,{l_{1}}}^{*}}{z_{i}}\exp\left({-\frac{{{{\left\|{{{{\bf{\bar{y}}}}_{i}}-{{{\bf{\bar{y}}}}_{l_{1}}}}\right\|}^{2}}}}{{2\sigma_{q}^{2}}}}\right)$ is a biased variable[38], and

[TABLE]

Let $\tau_{q,\bar{m}}$ and $\tau_{q,m}$ denote the output of the SVMq, and $\tau_{q,\bar{m}}$ and $\tau_{q,m}$ are the inputs of the Adder ${}_{\bar{m}}$ and the Adderm, respectively, where ${\tau_{q,\bar{m}}},{\tau_{q,m}}\in\left\{{0,1}\right\}$ , ${\tau_{q,\bar{m}}}+{\tau_{q,m}}=1$ , ${\bar{m}}$ , $m\in\mathcal{M}$ , and ${\bar{m}}\neq m$ . Then, for Adderm, the number of votes is updated by ${u_{m}}={u_{m}}+{\tau_{q,m}}$ , where the initial value of ${u_{m}}$ is [math], and $m\in\mathcal{M}$ .

Then, with the number of votes $\left\{{{u_{m}}}\right\}_{m=0}^{M-1}$ , the output label $\hat{z}$ is obtained as follows

[TABLE]

Finally, $\hat{z}$ is mapped to the demodulation result $\hat{s}$ .

After the entire network is trained, the parameters ${\bf{W}}_{k}$ , ${\bf{a}}_{k}$ , and ${\bf{b}}_{k}$ of the DBN, and ${\bf{c}}_{q},b_{q}^{*},\sigma_{q}$ of the OVO-SVM are optimized, where $k\in\{1,2,3\}$ and $q\in\mathcal{Q}$ . Then, the test signal ${\rm{{\cal T}_{2}}}=\left\{{\left({{{{\bf{\hat{y}}}}_{L_{1}+1}},{z_{L_{1}+1}}}\right),\left({{{{\bf{\hat{y}}}}_{L_{1}+2}},{z_{L_{1}+2}}}\right),\ldots,\left({{{{\bf{\hat{y}}}}_{{L}}},{z_{{L}}}}\right)}\right\}$ is converted to the feature vector ${{\bf{\bar{y}}}}$ , where ${L_{2}}$ is the number of test signal periods. The details of the DBN-SVM based demodulator are listed in Algorithm $1$ .

IV AdaBoost Based Demodulator

AdaBoost is a general method used to improve machine learning algorithms [39], which integrates multiple independent weakly classifiers into a stronger classifier. In this section, we exploit the $k$ -Nearest Neighbor (KNN) classifiers as the weak classifier for constructing the AdaBoost.

As shown in Fig. 3, the proposed AdaBoost consists $D$ KNN classifiers. The labeled training signal set is denoted by ${\rm{{\cal T}}}=\left\{{\left({{{{\bf{\hat{y}}}}_{1}},{z_{1}}}\right),\left({{{{\bf{\hat{y}}}}_{2}},{z_{2}}}\right),\ldots,\left({{{{\bf{\hat{y}}}}_{{L_{1}}}},{z_{{L_{1}}}}}\right)}\right\}$ .

Let ${{\bf{w}}_{d}}={\left[{{w_{d}}\left(1\right),{w_{d}}\left(2\right),\ldots,{w_{d}}\left(L_{1}\right)}\right]^{T}}$ denote the weight vector of $d$ th KNN, where $0\leq{w_{d}}\left(l\right)\leq 1$ , $l\in\mathcal{L}_{1}$ , and $\sum\limits_{l=1}^{{L_{1}}}{{w_{d}}\left(l\right)}=1$ , $d\in\mathcal{D}$ . For the $1$ st KNN, ${w_{1}}\left(l\right)=\frac{1}{L_{1}}$ , $l\in\mathcal{L}_{1}$ . Based on the weight vector ${\bf{w}}_{d}$ , the $d$ th KNN re-samples the training set ${\rm{{\cal T}}}$ and generates a new training set ${{{\rm{{\cal T}}}_{d}}}=\left\{{\left({{{{\bf{\tilde{y}}}}_{d_{1}}},{z_{d_{1}}}}\right),\left({{{{\bf{\tilde{y}}}}_{d_{2}}},{z_{d_{2}}}}\right),\ldots,\left({{{{\bf{\tilde{y}}}}_{{d_{L1}}}},{z_{{d_{L1}}}}}\right)}\right\}$ , $d_{l}\in\mathcal{L}_{1}$ .

Then, a vector in ${{{\rm{{\cal T}}}_{d}}}$ is searched with the minimum distance from ${{{{\bf{\hat{y}}}}_{l}}}$ , i.e.,

[TABLE]

Because the label of ${{\bf{\tilde{y}}}_{{d_{l}^{*}}}}$ is ${z_{d_{l}^{*}}}$ , ${f_{d}}\left({{{\bf{\hat{y}}}_{l}}}\right)={{z_{d_{l}^{*}}}}$ , which implies that the classification result of $d$ th KNN for ${{{{\bf{\hat{y}}}}_{l}}}$ is ${z_{d_{l}^{*}}}$ .

Let ${\chi_{d}}$ denote the weight sum of misclassified samples of $d$ th KNN as follows

[TABLE]

where $I\left({x,y}\right)$ is the indicator function, i.e.,

[TABLE]

Then, for $(d+1)$ th KNN, weight ${{\bf{w}}_{(d+1)}}={\left[{{w_{(d+1)}}\left(1\right),\ldots,{w_{(d+1)}}\left(L_{1}\right)}\right]^{T}}$ is updated as

[TABLE]

where ${\alpha_{d}}=\frac{1}{2}\ln\left({\frac{{1-{\chi_{d}}}}{{{\chi_{d}}}}}\right)$ , and ${Q_{d}}=\sum\limits_{{l}=1}^{{L_{1}}}{{w_{d}}\left({l}\right)\exp\left({-{\alpha_{d}}I\left({{f_{d}}\left({{{{\bf{\hat{y}}}}_{l}}}\right),{z_{l}}}\right)}\right)}$ is the normalization factor. If ${{{\bf{\hat{y}}}}_{l}}$ is classified correctly, i.e., $I\left({{f_{d}}\left({{{{\bf{\hat{y}}}}_{l}}}\right),{z_{l}}}\right)=0$ , ${w_{d+1}}\left({l}\right)=\frac{{{w_{d}}\left({l}\right)}}{{{Q_{d}}}}$ . Otherwise, ${w_{d+1}}\left({l}\right)=\frac{{{w_{d}}\left({l}\right)\exp\left({-{\alpha_{d}}}\right)}}{{{Q_{d}}}}$ .

After training $D$ KNNs, AdaBoost classifies ${{{{\bf{\hat{y}}}}_{l}}}$ as follows

[TABLE]

where ${{\alpha_{d}}}$ is the coefficient of $\left({1-I\left({{f_{d}}\left({{{{\bf{\hat{y}}}}_{l}}}\right),z_{l}}\right)}\right)$ and $I\left({{f_{d}}\left({{{{\bf{\hat{y}}}}_{l}}}\right),z_{l}}\right)$ can be regarded as the voting value, i.e., if $I\left({{f_{d}}\left({{{{\bf{\hat{y}}}}_{l}}}\right),z_{l}}\right)=0$ , ${{f_{d}}\left({{{{\bf{\hat{y}}}}}}\right)}$ classifies signal ${{{{\bf{\hat{y}}}}_{l}}}$ into class $z_{l}$ , otherwise, ${{{{\bf{\hat{y}}}}_{l}}}$ does not belong to class $z_{l}$ . The class with the maximum sum of weighted voting value, ${{\alpha_{d}}\left({1-I\left({{f_{d}}\left({{{{\bf{\hat{y}}}}_{l}}}\right),z_{l}}\right)}\right)}$ , for all classifiers, is identified as the classification result ${\hat{z}_{{l}}}$ of the Adaboost classifier, and then ${\hat{z}_{{l}}}$ is mapped to demodulation result ${\hat{s}_{{l}}}$ . The details of the KNN-based AdaBoost demodulator are listed in Algorithm $2$ .

V Experimental Results and Discussions

In this section, the performance of the proposed DBN-SVM based demodulator and AdaBoost based demodulator is investigated. Also the performance of the the DBN based, SVM based, and maximum likelihood (MLD) based demodulation methods are presented for comparison.

V-A The end-to-end wireless communication system prototype

As shown in Fig. 4, an end-to-end wireless communication system prototype is first established to collect the dataset, which consists of a source, a RF vector signal generator, a transmitter antenna, a receiver antenna, and a vector signal analyzer. The parameters of the devices of the proposed end-to-end wireless communication system prototype are listed in Table I.

The volume environment is a $15\times 5\times 3$ $\left({{{\rm{m}}^{3}}}\right)$ office, where $15$ , $5$ , and $3$ denote the length, width, and height, respectively. Note that the distance between the transmitter and the receiver is approximately $10$ meters. The power of the background noise is $78$ dBm.

The carrier frequency ${f_{c}}$ and the sampling rate ${f_{s}}$ are $2.4$ GHz and $100$ MHz/s, respectively. For each $M$ -QAM modulation scheme, the number of sample points $N$ has four cases, i.e., $N={10,20,40}$ , and $80$ .

To reduce the generalization error, the collected data set contains $10000$ transmit signal periods, in which $8000$ periods are used for training and $2000$ periods are used for testing.

V-B Experimental Results

DBN-SVM based and AdaBoost based demodulators are trained on these training sets. The DBN-SVM based demodulator training ends after $110$ epochs, after which the training loss almost does not decline, and the AdaBoost based demodulator training ends when the iteration error is less than $10^{-3}$ . In the experiment, signal sets with different SNRs, ranging from $3$ to $25$ dB, are chosen as the validation sets; the DBN based, SVM based, and MLD based demodulation methods are used for comparison.

In Fig. 5 and Fig. 6, the demodulation performance versus SNR of the proposed demodulator and the three baseline schemes are compared by the demodulation of $4$ -QAM and $16$ -QAM, respectively. The demodulation accuracy of the models increases as SNR increases. In particular, Fig. 5 indicates that the demodulation accuracy of all methods are close to $100\%$ when SNR $\geq 15$ dB, and the proposed AdaBoost based demodulator is significantly superior to the other models when SNR $\leq 13$ dB. Besides, the proposed DBN-SVM based demodulator has better performance than the DBN-based and SVM-based demodulation methods. In Fig. 6, compared with Fig. 5, we focus on the same performance index at $16$ -QAM. It shows the designed AdaBoost based demodulator is close to $100\%$ when SNR $\geq 15$ dB. However, other methods cannot approach $100\%$ as SNR increases. Furthermore, among these demodulation methods, the AdaBoost based demodulator obviously outperform the other four methods. It can be observed that the demodulation accuracy achieved by DBN-SVM based demodulator exceeds ones by the DBN-based, SVM-based demodulation methods. Although the overall trend of MLD classification accuracy increases as SNR increases, it has a obvious fluctuation. The reason is that the practical wireless channels include complicaful interferences, but the robustness of MLD is poor.

In Fig. 7, the accuracy performance for different sampling points at $16$ -QAM is simulated. It can be observed that the demodulation accuracy increases with the number of sample points. Furthermore, the demodulation accuracy can approximately achieve $100\%$ with $N=40$ or $N=80$ when SNR $\geq 15$ dB. However, with an increase in the number of sample points, the computational complexity also increases.

Fig. 8 shows the demodulation accuracy achieved by the AdaBoost based demodulator versus the number of training signal periods, where the number of sampling points is $40$ and SNR = $12$ dB. The result shows that the demodulation accuracy initially increases with an increase in the number of training signal periods, and then, it reaches saturation when the number of training signal periods is $5000$ . It can be observed that, compared with $32$ -QAM, $16$ -QAM can achieve higher accuracy. Meanwhile, $16$ -QAM can provide stable performance with relatively fewer training signal periods. Different modulation models have different requirements with different number of training signals periods. In general, higher orders require longer training signals periods.

Fig. 9 presents the demodulation accuracies of BPSK and $M$ -QAM modulation schemes. In this experiment, the AdaBoost based demodulation algorithm was employed, where the number of sampling points is $N=40$ . The demodulation accuracy for all modulation schemes increases with SNR. Meanwhile, the accuracy achieved by the BPSK-modulation scheme is better than the other seven schemes for the same SNR. Furthermore, Fig. 9 also indicates that the demodulation accuracy reduces with an increase of the modulation order.

In Fig. 10, the same modulation schemes, demodulation algorithm, and sampling points are used as in Fig. 9, where the effective capacity of different modulation methods versus SNR are reported. The effective capacity by BPSK, $4$ -QAM, and $8$ -QAM almost remain unchanged with an increase in SNR. It is found that the modulation order has a considerable positive impact on the performance of the transmission capacity. The performance gap between the low order and the high order modulation is clearer when SNR $\leq 15$ dB. However, the demodulation accuracy of high order modulation is low, so there is a trade-off between the demodulation accuracy and the effective capacity.

VI Conclusion

In this paper, a flexible end-to-end wireless communications prototype platform was proposed for real physical environments. Then, the first open measured modulation data dataset with eight modulation schemes, i.e., BPSK, $4$ -QAM, $8$ -QAM, $16$ -QAM, $32$ -QAM, $64$ -QAM, $128$ -QAM, and $256$ -QAM, were established and accessed online. Furthermore, two DL-based demodulators, i.e., DBN-SVM based demodulator and AdaBoost based demodulator, were proposed. Based on the real dataset, the demodulation performance of the proposed demodulators were tested. Finally, experimental results indicated that the proposed demodulators outperform the DBN based, SVM based, and MLD based demodulators at various scenarios.

Bibliography39

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] J. G. Proakis, Digital communications , Mc Graw-Hill, New York, 2001.
2[2] M. Shafi, A. F. Molisch, P. J. Smith, T. Haustein, P. Zhu, P. D. Silva, F. Tufvesson, A. Benjebbour, and G. Wunder, “5G : A Tutorial Overview of Standards, Trials, Challenges, Deployment, and Practice,” IEEE J. Select. Areas Commun. , vol. 35, no. 6, pp. 1201–1221, Jun. 2017.
3[3] Y. Lecun, Y. Bengio, and G. Hinton, “Deep learning.,” Nature , vol. 521, no. 7553, pp. 436, May 2015.
4[4] C. Liu, Y. Cao, Y. Luo, G. Chen, V. Vokkarane, and Y. Ma, Deep Food: Deep Learning-Based Food Image Recognition for Computer-Aided Dietary Assessment , Springer, 2016.
5[5] T. Zhou, S. Yang, L. Wang, J. Yao, and G. Gui, “Improved cross-label suppression dictionary learning for face recognition,” IEEE Access , vol. 6, pp. 48716–48725, Aug. 2018.
6[6] D. Geronimo, J. Serrat, A. M. Lopez, and R. Baldrich, “Traffic sign recognition for computer vision project-based learning,” IEEE Trans. Educ. , vol. 56, no. 3, pp. 364–371, Aug. 2013.
7[7] T. Tan, Y. Qian, and K. Yu, “Cluster adaptive training for deep neural network based acoustic model,” IEEE Trans. Audio Speech Lang. Proces. , vol. 24, no. 3, pp. 459–468, Mar. 2015.
8[8] Y. Ling, C. Jin, G. Ding, Y. Tu, and J. Sun, “Spectrum prediction based on Taguchi method in deep learning with long short-term memory,” IEEE Access , vol. 6, pp. 45923–45933, Dec. 2018.