Deep Learning for Signal Demodulation in Physical Layer Wireless Communications: Prototype Platform, Open Dataset, and Analytics
Hongmei Wang, Zhenzhen Wu, Shuai Ma, Songtao Lu, Han Zhang, Guoru, Ding, and Shiyin Li

TL;DR
This paper introduces a new open dataset and prototype platform for wireless signal demodulation, proposing two deep learning-based demodulators that outperform traditional methods in real-world experiments.
Contribution
The paper presents the first open dataset of real wireless signals and develops two novel DL-based demodulators combining DBN-SVM and AdaBoost techniques.
Findings
DBN-SVM demodulator outperforms traditional classifiers
AdaBoost demodulator achieves higher accuracy with KNN weak classifiers
Proposed methods surpass existing single-classifier approaches
Abstract
In this paper, we investigate deep learning (DL)-enabled signal demodulation methods and establish the first open dataset of real modulated signals for wireless communication systems. Specifically, we propose a flexible communication prototype platform for measuring real modulation dataset. Then, based on the measured dataset, two DL-based demodulators, called deep belief network (DBN)-support vector machine (SVM) demodulator and adaptive boosting (AdaBoost) based demodulator, are proposed. The proposed DBN-SVM based demodulator exploits the advantages of both DBN and SVM, i.e., the advantage of DBN as a feature extractor and SVM as a feature classifier. In DBN-SVM based demodulator, the received signals are normalized before being fed to the DBN network. Furthermore, an AdaBoost based demodulator is developed, which employs the -Nearest Neighbor (KNN) as a weak classifier to form a…
| Experiment setup | Type and parameters |
|---|---|
| EXG RF vector signal generator | Keysight N5172B |
| MXA vector signal analyzer | Keysight N9020B |
| Antenna Gain | 24 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSupport Vector Machine
Deep Learning for Signal Demodulation in Physical Layer Wireless Communications: Prototype Platform, Open Dataset, and Analytics
Hongmei Wang, Zhenzhen Wu, Shuai Ma, Songtao Lu, Han Zhang, Guoru Ding, and Shiyin Li Manuscript received January 22, 2019. Corresponding author: Shuai Ma.)H. Wang, Z. Wu, S. Ma, and S. Li are with Information and Control Engineering, China University of Mining and Technology, Xuzhou, China 221116 (Email:[email protected], [email protected], [email protected], [email protected]).S. Ma is also with the State Key Laboratory of Integrated Services Networks, Xidian University, Xi’an 710071, China.S. Lu is with the Department of Electrical and Computer Engineering, University of Minnesota, Minneapolis, MN 55455, USA (e-mail: [email protected]).H. Zhang is with the Department of Electrical and Computer Engineering, University of California, Davis, CA 95616, USA (e-mail: [email protected]).G. Ding is with the College of Communications Engineering, Army Engineering University, Nanjing 210007, China (e-mail: [email protected])The work of S. Ma and S. Li were supported by the Fundamental Research Funds for the Central Universities under Grant 2017QNA32,by the National Natural Science Foundation of China under Grant 61701501, Grant 61771474; by the Natural Science Foundation of Jiangsu Province under Grant BK20170287; by China Postdoctoral Science Foundation under Grant 2016M600452; by the State Key Laboratory of Integrated Services Networks (Xidian University) under grant ISN19-07; and by the Key Laboratory of Cognitive Radio and Information Processing , Ministry of Education (Guilin University of Electronic Technology) under grant CRKL180204, by Key Laboratory of Ocean Observation-Imaging Testbed of Zhejiang Province. The work of H. Wang was supported by the National Natural Science Foundation of China under Grant 61601464.
Abstract
In this paper, we investigate deep learning (DL)-enabled signal demodulation methods and establish the first open dataset of real modulated signals for wireless communication systems. Specifically, we propose a flexible communication prototype platform for measuring real modulation dataset. Then, based on the measured dataset, two DL-based demodulators, called deep belief network (DBN)-support vector machine (SVM) demodulator and adaptive boosting (AdaBoost) based demodulator, are proposed. The proposed DBN-SVM based demodulator exploits the advantages of both DBN and SVM, i.e., the advantage of DBN as a feature extractor and SVM as a feature classifier. In DBN-SVM based demodulator, the received signals are normalized before being fed to the DBN network. Furthermore, an AdaBoost based demodulator is developed, which employs the -Nearest Neighbor (KNN) as a weak classifier to form a strong combined classifier. Finally, experimental results indicate that the proposed DBN-SVM based demodulator and AdaBoost based demodulator are superior to the single classification method using DBN, SVM, and maximum likelihood (MLD) based demodulator.
Index Terms:
Machine learning, DBN-SVM based demodulator, AdaBoost based demodulator.
I Introduction
Conventional wireless communication systems are generally designed in accordance with the rigorous mathematical theories and accurate system models [1]. However, because of increasing wireless service requirements, such as the use of smartphones, virtual reality, and internet of things (IoT), it is challenging to characterize future complex wireless communication networks accurately by using tractable mathematical models or system models [2]. Recently, deep learning (DL) [3], as an effective method to handle complex problems, has attracted increasing attention from both academia and industry. DL has been applied in image recognition [4, 5], computer vision [6], natural language processing [7] and spectrum prediction [8], etc. In addition, some literatures have focused on using DL to optimize performance of wireless communication systems [9, 10, 11]. In [9], an unsupervised learning-based fast beamforming method is proposed to maximize the weighted sum rate under the total power constraint. In [10], a deep recurrent neural network based algorithm is proposed to tackle energy efficient resource allocation problem for heterogeneous IoT. In [11], a three dimensional message-passing algorithm based on deep learning scheme is proposed to minimize the weighted sum of the secondary interference power for cognitive radio networks. Recent works [12, 13] have interpreted an end-to-end wireless communication system as an auto-encoder. This is promising for applications of DL to wireless communications.
Demodulation is one of the fundamental modules for wireless communications systems for high-speed transmission with a low bit error rate. Theoretically, optimum demodulators of conventional wireless communication systems are designed for additive white Gaussian noise (AWGN) channels [1]. Moreover, both channel state information (CSI) and channel noise distribution are usually required. Most previous studies [14, 15, 16, 17, 18, 19] have assumed that each receiver can accurately estimate the fading coefficients. However, practical wireless communication channels may suffer from multi-path fading, impulse noise, spurious or continuous jamming, and numerous other complex impairments, which deteriorate demodulation performance significantly. Because of the limited length of the training sequence, the estimate CSI will have limited accuracy [20]. Especially, for fast-fading scenarios, it is difficult to accurately estimate CSI because the fading coefficients change rapidly during the data transmission period. Designing optimum demodulators for different channel models is challenging because the channel model may not be known at the receiver end.
Given the above issues, DL-based model-free demodulators have attracted a considerable amount of attention, where the requirements for a priori knowledge can be widely relaxed or even removed [21]. Because the information of the modulated signals is represented by the amplitude and phase, feature extraction is of critical importance for signal demodulation.
DL-based demodulators have been investigated in conventional radio frequency (RF) systems. In [22], a deep convolutional network demodulator (DCND) is proposed to demodulate mixed modulated signals, which can further reduce the bit error rate compared with the coherent demodulation method. In [23], the authors show that the proposed demodulator based on deep belief network (DBN) is feasible for an AWGN channel with a certain channel impulse response and a Rayleigh non-frequency-selective flat fading channel. In [24], a DL-based detector is proposed for signal demodulation in short-range multi-channels without a signal equalizer. In [25], the authors show that deep convolutional neural networks (DCNN) for frequency-shift keying (FSK) demodulation can substantially reduce error bit probabilities over an AWGN Rayleigh-fading channel. To the best of our knowledge, most of existing DL-based demodulation schemes are based on simulated data rather than real measured data.
This paper presents a data-driven framework for DL-based demodulators. Specifically, two data-driven demodulation methods based on DBN-support vector machine (SVM) and adaptive boosting (AdaBoost) [26] are developed for end-to-end wireless communication systems. These methods learn and extract features from the received modulation signals without any prior knowledge of the channel model. Moreover, the performance of the two data-driven demodulators are evaluated on different modulation schemes through real measured data. The main contributions of this paper are as follows:
- •
A flexible end-to-end wireless communication prototype platform is developed for application in real physical environments, which can generate real signals. The prototype is used to establish measured modulation datasets from real communication systems in actual physical environments in eight modulation schemes, i.e., binary phase shift keying (BPSK) and multiple quadrature amplitude modulation (-QAM) modulation, where and . The received SNR of the eight modulated signals are measured from dB to dB. An open online real modulated dataset is established, available at https://pan.baidu.com/s/1biDooH6E81Toxa2u4D3p2g or https://drive.google.com/open?id=1jXO9OMZOyVMOYv QSn3WVmlfQoQbonKuo , where the transmission distance of the eight modulated signals is measured in an indoor environment. To the best of our knowledge, this is the first open dataset of real modulated signals for wireless communication systems.
- •
Then, based on the measured data, two DL-based demodulators are proposed, namely, DBN-SVM based demodulator and AdaBoost based demodulator. The proposed DBN-SVM based demodulator, which has a novel demodulation architecture, exploits the advantages of both DBN and SVM, i.e., the advantage of DBN as a feature extractor and SVM as a feature classifier. To accelerate the convergence rate, the received signals are first normalized before being fed to the DBN network so that the features of the received signals can be extracted, the SVM is utilized to classify these features.
- •
An AdaBoost based demodulator, which utilizes multiple KNNs as a weak classifier to form a strong combined classifier, is developed. The proposed AdaBoost based demodulator increases the weights for the error demodulated symbols and decreases the corresponding weights for correctly demodulated symbols during the iterations.
- •
Finally, the demodulation performance of the two proposed data-driven demodulators are investigated. Specifically, the demodulation accuracies of the two DL-based demodulators decrease over the respective transmission and modulation orders for a fixed transmission distance. The experimental results also show that the demodulation accuracy of the DBN-SVM based demodulator is higher than those of DBN-based and SVM-based demodulators. Moreover, the demodulation accuracy of the AdaBoost based demodulator is higher than that of the DBN-SVM based demodulator at the lower SNR regions, and the accuracies of the two demodulators are similar at high SNRs. For the high SNR scenario, a high-order modulation is generally preferred.
The remainder of this paper is organized as follows. Section II describes the system model. Section III explores the structures of the DBN-SVM and AdaBoost, including detailed descriptions of the data stream and how to make classification decisions. In section IV, the data analysis results are provided and analyzed. Finally, the conclusions from the study are drawn in Section V.
Notations: Boldfaced lowercase and uppercase letters represent vectors and matrices, respectively. The transpose of a matrix is denoted as . , , , , , , and .
II System Model
An end-to-end wireless communication system111The term end-to-end wireless system model implies that signal features are learned from a single deep neural network, without the complex multi-stage expert machine learning processing[12, 13, 27, 28, 29]. is considered, which includes a single antenna transmitter and a single antenna receiver, as illustrated in Fig. 1. By adopting the BPSK or -QAM digital modulation schemes, the transmitted signal is given as
[TABLE]
where , and denote the amplitude, phase, and period of the signal , respectively; is the carrier frequency.
Let denote the multipath channel between the transmitter and the receiver, which may suffer nonlinear distortion, interference, and frequency selective fading. At the receiver, the received signal is given by
[TABLE]
where denotes the received noise.
Then, the received analog signal is converted to the digital signal via the vector signal analyzer. Let denote the total sampled digital signal vector, where is the th sample, is the number of samples of one period, and denotes the number of signal periods.
Before the demodulation process, the received signal is normalized to , which can accelerate the DL network processing speed[30]. Senerally, the normalized data is given by
[TABLE]
where , and .
Because the information of the BPSK and -QAM are represented by amplitudes and phases, DL is used to extract information features from the received signals. Specifically, with the sampled signal vector , two DL-based demodulators are proposed: DBN-SVM based demodulator and AdaBoost based demodulator. The DL-based demodulators consist of two phases: training phase and testing phase. During the training phase, the parameters of the DL-based demodulators are optimized with the training dataset. Then, in the testing phase, the demodulators demodulate the received signal and recover the transmitted information.
Let denote the label signal of the th period, where and is the label set, i.e., , which is determined by the modulation scheme. Let denote the labeled training signal set, where denotes the normalized signal of the th period, and denotes the total number of training signal periods .
III DBN-SVM based Demodulator
As an unsupervised features extraction method, the DBN can efficiently extract high-level and hierarchical features from the measured signal, while the SVM minimizes the structure risk and shows good learning and generalization performance with a small amount of samples. Inspired by those advantages of the two approaches, a combination of DBN and SVM for demodulation is proposed. The DBN-SVM demodulator is shown in Fig. 2, the DBN is used as a feature generator and the SVM is used as a classifier.
III-1 DBN
The proposed DBN includes three stacked restricted Boltzmann machines (RBM)[31], i.e., RBM1, RBM2, and RBM3, as shown in Fig. 2. Specifically, RBMk is an undirected, bipartite graphical model, and it composes a visible layer and a hidden layer , where and are the th neuron of and the th neuron of , respectively, , , . The visible layer and hidden layer are fully connected via a symmetric undirected weighted matrix , where is a weight vector between and . For the three RBM, there is no intralayer connections between either the visible layer or the hidden layer.
For RBMk, the energy is defined by combining the configuration of both and as follows
[TABLE]
where is an offset vector of , and is an offset vector of .
Based on , the probability of is given by
[TABLE]
where is the normalization factor.
During the training phrase, the goal of the RBMk is to maximize the log-likelihood function as follows
[TABLE]
To solve equation (6), the gradient descent method is used to iteratively calculate the variables , , and , where the corresponding partial derivative with respect to , , and can be written as
[TABLE]
According to [32], the conditional probability and are respectively given by
[TABLE]
where , , , , and .
Then, the variables , , and are updated by the following equations[33]
[TABLE]
where is the learning rate.
By employing the gradient descent method, RBM1 is trained first, where and . Then, let , and RBM2 is trained. Similarly, after training RBM2, let , and RBM3 is trained. Moreover, when RBM3 is trained, the parameters of DBN can be obtained, i.e., . Then, the parameters are further fine-tuned by the supervised back propagation (BP) algorithm [34].
After DBN is trained, it outputs the extracted feature , where . Let denote the output feature set.
III-2 OVO-SVM
With the extracted feature set , the one-versus-one (OVO)-SVM is adopted for further classification, which achieves multiclassification by solving the two-classification subproblems [35, 36]. As shown in Fig. 2, OVO-SVM exploits nonlinear two-class SVMs, i.e., SVM0,…,SVM, to classify categories for -QAM modulation, where .
To map pedestrian features to a high dimensional space, a Gaussian kernel is introduced, which can be expressed as
[TABLE]
where is the bandwidth of the Gaussian kernel and .
According to the nonlinear SVM theory [37], the nonlinear two-class SVMq problem can be formulated as
[TABLE]
where and .
By solving linear programming (11), the optimal solution is obtained. Then, the nonlinear two-class SVMq decision function , with , , is given as
[TABLE]
where is a biased variable[38], and
[TABLE]
Let and denote the output of the SVMq, and and are the inputs of the Adder and the Adderm, respectively, where , , , , and . Then, for Adderm, the number of votes is updated by , where the initial value of is [math], and .
Then, with the number of votes , the output label is obtained as follows
[TABLE]
Finally, is mapped to the demodulation result .
After the entire network is trained, the parameters , , and of the DBN, and of the OVO-SVM are optimized, where and . Then, the test signal is converted to the feature vector , where is the number of test signal periods. The details of the DBN-SVM based demodulator are listed in Algorithm .
IV AdaBoost Based Demodulator
AdaBoost is a general method used to improve machine learning algorithms [39], which integrates multiple independent weakly classifiers into a stronger classifier. In this section, we exploit the -Nearest Neighbor (KNN) classifiers as the weak classifier for constructing the AdaBoost.
As shown in Fig. 3, the proposed AdaBoost consists KNN classifiers. The labeled training signal set is denoted by .
Let denote the weight vector of th KNN, where , , and , . For the st KNN, , . Based on the weight vector , the th KNN re-samples the training set and generates a new training set , .
Then, a vector in is searched with the minimum distance from , i.e.,
[TABLE]
Because the label of is , , which implies that the classification result of th KNN for is .
Let denote the weight sum of misclassified samples of th KNN as follows
[TABLE]
where is the indicator function, i.e.,
[TABLE]
Then, for th KNN, weight is updated as
[TABLE]
where , and is the normalization factor. If is classified correctly, i.e., , . Otherwise, .
After training KNNs, AdaBoost classifies as follows
[TABLE]
where is the coefficient of and can be regarded as the voting value, i.e., if , classifies signal into class , otherwise, does not belong to class . The class with the maximum sum of weighted voting value, , for all classifiers, is identified as the classification result of the Adaboost classifier, and then is mapped to demodulation result . The details of the KNN-based AdaBoost demodulator are listed in Algorithm .
V Experimental Results and Discussions
In this section, the performance of the proposed DBN-SVM based demodulator and AdaBoost based demodulator is investigated. Also the performance of the the DBN based, SVM based, and maximum likelihood (MLD) based demodulation methods are presented for comparison.
V-A The end-to-end wireless communication system prototype
As shown in Fig. 4, an end-to-end wireless communication system prototype is first established to collect the dataset, which consists of a source, a RF vector signal generator, a transmitter antenna, a receiver antenna, and a vector signal analyzer. The parameters of the devices of the proposed end-to-end wireless communication system prototype are listed in Table I.
The volume environment is a office, where , , and denote the length, width, and height, respectively. Note that the distance between the transmitter and the receiver is approximately meters. The power of the background noise is dBm.
The carrier frequency and the sampling rate are GHz and MHz/s, respectively. For each -QAM modulation scheme, the number of sample points has four cases, i.e., , and .
To reduce the generalization error, the collected data set contains transmit signal periods, in which periods are used for training and periods are used for testing.
V-B Experimental Results
DBN-SVM based and AdaBoost based demodulators are trained on these training sets. The DBN-SVM based demodulator training ends after epochs, after which the training loss almost does not decline, and the AdaBoost based demodulator training ends when the iteration error is less than . In the experiment, signal sets with different SNRs, ranging from to dB, are chosen as the validation sets; the DBN based, SVM based, and MLD based demodulation methods are used for comparison.
In Fig. 5 and Fig. 6, the demodulation performance versus SNR of the proposed demodulator and the three baseline schemes are compared by the demodulation of -QAM and -QAM, respectively. The demodulation accuracy of the models increases as SNR increases. In particular, Fig. 5 indicates that the demodulation accuracy of all methods are close to when SNR dB, and the proposed AdaBoost based demodulator is significantly superior to the other models when SNR dB. Besides, the proposed DBN-SVM based demodulator has better performance than the DBN-based and SVM-based demodulation methods. In Fig. 6, compared with Fig. 5, we focus on the same performance index at -QAM. It shows the designed AdaBoost based demodulator is close to when SNR dB. However, other methods cannot approach as SNR increases. Furthermore, among these demodulation methods, the AdaBoost based demodulator obviously outperform the other four methods. It can be observed that the demodulation accuracy achieved by DBN-SVM based demodulator exceeds ones by the DBN-based, SVM-based demodulation methods. Although the overall trend of MLD classification accuracy increases as SNR increases, it has a obvious fluctuation. The reason is that the practical wireless channels include complicaful interferences, but the robustness of MLD is poor.
In Fig. 7, the accuracy performance for different sampling points at -QAM is simulated. It can be observed that the demodulation accuracy increases with the number of sample points. Furthermore, the demodulation accuracy can approximately achieve with or when SNR dB. However, with an increase in the number of sample points, the computational complexity also increases.
Fig. 8 shows the demodulation accuracy achieved by the AdaBoost based demodulator versus the number of training signal periods, where the number of sampling points is and SNR = dB. The result shows that the demodulation accuracy initially increases with an increase in the number of training signal periods, and then, it reaches saturation when the number of training signal periods is . It can be observed that, compared with -QAM, -QAM can achieve higher accuracy. Meanwhile, -QAM can provide stable performance with relatively fewer training signal periods. Different modulation models have different requirements with different number of training signals periods. In general, higher orders require longer training signals periods.
Fig. 9 presents the demodulation accuracies of BPSK and -QAM modulation schemes. In this experiment, the AdaBoost based demodulation algorithm was employed, where the number of sampling points is . The demodulation accuracy for all modulation schemes increases with SNR. Meanwhile, the accuracy achieved by the BPSK-modulation scheme is better than the other seven schemes for the same SNR. Furthermore, Fig. 9 also indicates that the demodulation accuracy reduces with an increase of the modulation order.
In Fig. 10, the same modulation schemes, demodulation algorithm, and sampling points are used as in Fig. 9, where the effective capacity of different modulation methods versus SNR are reported. The effective capacity by BPSK, -QAM, and -QAM almost remain unchanged with an increase in SNR. It is found that the modulation order has a considerable positive impact on the performance of the transmission capacity. The performance gap between the low order and the high order modulation is clearer when SNR dB. However, the demodulation accuracy of high order modulation is low, so there is a trade-off between the demodulation accuracy and the effective capacity.
VI Conclusion
In this paper, a flexible end-to-end wireless communications prototype platform was proposed for real physical environments. Then, the first open measured modulation data dataset with eight modulation schemes, i.e., BPSK, -QAM, -QAM, -QAM, -QAM, -QAM, -QAM, and -QAM, were established and accessed online. Furthermore, two DL-based demodulators, i.e., DBN-SVM based demodulator and AdaBoost based demodulator, were proposed. Based on the real dataset, the demodulation performance of the proposed demodulators were tested. Finally, experimental results indicated that the proposed demodulators outperform the DBN based, SVM based, and MLD based demodulators at various scenarios.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. G. Proakis, Digital communications , Mc Graw-Hill, New York, 2001.
- 2[2] M. Shafi, A. F. Molisch, P. J. Smith, T. Haustein, P. Zhu, P. D. Silva, F. Tufvesson, A. Benjebbour, and G. Wunder, “5G : A Tutorial Overview of Standards, Trials, Challenges, Deployment, and Practice,” IEEE J. Select. Areas Commun. , vol. 35, no. 6, pp. 1201–1221, Jun. 2017.
- 3[3] Y. Lecun, Y. Bengio, and G. Hinton, “Deep learning.,” Nature , vol. 521, no. 7553, pp. 436, May 2015.
- 4[4] C. Liu, Y. Cao, Y. Luo, G. Chen, V. Vokkarane, and Y. Ma, Deep Food: Deep Learning-Based Food Image Recognition for Computer-Aided Dietary Assessment , Springer, 2016.
- 5[5] T. Zhou, S. Yang, L. Wang, J. Yao, and G. Gui, “Improved cross-label suppression dictionary learning for face recognition,” IEEE Access , vol. 6, pp. 48716–48725, Aug. 2018.
- 6[6] D. Geronimo, J. Serrat, A. M. Lopez, and R. Baldrich, “Traffic sign recognition for computer vision project-based learning,” IEEE Trans. Educ. , vol. 56, no. 3, pp. 364–371, Aug. 2013.
- 7[7] T. Tan, Y. Qian, and K. Yu, “Cluster adaptive training for deep neural network based acoustic model,” IEEE Trans. Audio Speech Lang. Proces. , vol. 24, no. 3, pp. 459–468, Mar. 2015.
- 8[8] Y. Ling, C. Jin, G. Ding, Y. Tu, and J. Sun, “Spectrum prediction based on Taguchi method in deep learning with long short-term memory,” IEEE Access , vol. 6, pp. 45923–45933, Dec. 2018.
