Supervised Machine Learning for Signals Having RRC Shaped Pulses
Mohammad Bari, Hussain Taher, Syed Saad Sherazi, Milos Doroslovacki

TL;DR
This paper compares supervised machine learning methods like SVMs, neural networks, and logistic regression for modulation recognition of signals with RRC-shaped pulses under extreme noisy conditions, using simple statistical features.
Contribution
It introduces a robust feature set based on sample mean and variance for distinguishing FSK and QAM-PSK signals in challenging environments.
Findings
Support vector machines outperform other classifiers in noisy conditions.
Simple statistical features effectively differentiate modulation types.
Robust classification achieved despite joint impairments and noise.
Abstract
Classification performances of the supervised machine learning techniques such as support vector machines, neural networks and logistic regression are compared for modulation recognition purposes. The simple and robust features are used to distinguish continuous-phase FSK from QAM-PSK signals. Signals having root-raised-cosine shaped pulses are simulated in extreme noisy conditions having joint impurities of block fading, lack of symbol and sampling synchronization, carrier offset, and additive white Gaussian noise. The features are based on sample mean and sample variance of the imaginary part of the product of two consecutive complex signal values.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLogistic Regression
Supervised Machine Learning for Signals Having RRC Shaped Pulses
Mohammad Bari
Hussain Taher
Syed Saad Sherazi
Miloš Doroslovački
Abstract
Classification performances of the supervised machine learning techniques such as support vector machines, neural networks and logistic regression are compared for modulation recognition purposes. The simple and robust features are used to distinguish continuous-phase FSK from QAM-PSK signals. Signals having root-raised-cosine shaped pulses are simulated in extreme noisy conditions having joint impurities of block fading, lack of symbol and sampling synchronization, carrier offset, and additive white Gaussian noise. The features are based on sample mean and sample variance of the imaginary part of the product of two consecutive complex signal values.
Index Terms:
Machine learning, block fading, support vector machines, logistic regression, neural networks.
I Introduction
Signal separation (SS) is used in applications such as interference identification, electronic warfare, enforcement of civilian spectrum compliance, radar, intelligent modems, cognitive and software defined radios. Most of the published work involving the above mentioned communication systems assume that the signals and their parameters are known. For instance , the works in [1] and [2] do not focus on the SS part. Understandably the focus is on the application itself. This paper studies the SS in noisy conditions that serve as a prerequisite for several applications including but not limited to those mentioned above.
The features in this work, discussed for several scenarios in [3]-[7], separate the continuous phase frequency shift keying (CPFSK) modulation from linear modulations of phase shift keying (PSK) and quadrature amplitude modulation (QAM). Root raised cosine (RRC) shaped pulses are used to generate the signals. Modulations are simulated to have the joint presence of block fading, lack of symbol and sampling synchronization, carrier offset, additive white Gaussian noise (AWGN) and unknown pulse shape (unknown roll-off). None of the mentioned works discuss the effects of block fading on the performance of SS. Also, support vector machines (SVM) are the only pattern recognition-based approach being employed in the above mentioned works. In this paper, block fading is introduced to make the conditions more challenging. Furthermore, this work compares the classification performances of the SVM, logistic regression (LR) and neural networks (NN) for the same set of features. The order of modulation of the CPFSK signals (binary, 4-ary and 8-ary CPFSK) is identified in [8]. In [9], frequency modulated signals are separated from the linearly modulated signals.
II Signal Model
We model the complex baseband continuous-time received signal as [4]
[TABLE]
where is the transmitted-signal, is the initial phase uniformly distributed over , is the carrier offset, models the fading. is a Rayleigh random variable. is uniformly distributed over and is independent of . is zero-mean complex noise and is the time delay. Furthermore, it is assumed that the bandwidth of the receiver’s filter is in general larger than the bandwidth of the transmitted signal and the power spectral density of does not vary within the filter’s bandwidth. In this work the center of the signal’s spectrum is translated to be around a certain desired normalized frequency, say [math] or .
For linear modulations the transmitted-signal is
[TABLE]
where are the amplitude and phase of the transmitted symbol, is the pulse shape function and is the symbol period. In the case of FSK modulation
[TABLE]
where defines instantaneous frequency pulse shape and for BFSK. Note that (3) models continuous phase FSK, which is of higher practical interest than non-continuous phase FSK used in [10].
Sampling period, , is assumed to be . Symbol period is given by
[TABLE]
where is the oversampling and is uniformly distributed in . Finally the complex baseband discrete-time received signal is
[TABLE]
where , , and is complex circular AWGN having zero mean and unit variance. Note that where is the integer part and is the fractional part of the time delay measured in sampling periods as time units.
III The Three Features
Let
[TABLE]
where * is the complex conjugate operator. We assume that , i.e., fading is skipped for the sake of simplicity. for noiseless QAM-PSK signals is
[TABLE]
where Similarly, applying on noiseless FSK signals yields
[TABLE]
Let us consider the means of imaginary part of in (7) and (8) for linear and BFSK modulations, respectively. For equiprobable constellation points of each modulation, mean of imaginary part of for 16-QAM, BPSK, 4-PSK and 8-PSK signals is given by
[TABLE]
where 16-QAM has constellation points , BPSK’s phases , 4-PSK’s phases , and 8-PSK’s phases . Constellation points for 16-QAM, BPSK, 4-PSK and 8-PSK are chosen such that the average power is unity. Mean of imaginary part of in (8) for BFSK signal is
[TABLE]
where
[TABLE]
Next, let us consider the variances of imaginary part of in (7) for QAM and PSK modulations, and in (8) for BFSK modulation. The variance for BPSK is
[TABLE]
For QAM, 4-PSK and 8-PSK modulations, whose signal constellations are invariant to rotation, the variance is
[TABLE]
The variance of in (8) for BFSK modulation is
[TABLE]
IV Supervised Learning Techniques
In this work, classification performances of the SVM, LR and NN are compared for the features introduced in [3]. The features are
sample mean of for the signal obtained by frequency downconversion to , 2. 2.
sample variance of for the signal obtained by frequency downconversion to 0, and 3. 3.
sample variance of for the signal obtained by frequency downconversion to .
The features based on for the signal obtained by frequency downconversion to can be seen as based on for the signal obtained by frequency downconversion to 0.
IV-A SVM
SVM are initially introduced in [11]. The current soft margin form of SVM is presented in [12]. As in any pattern recognition-based approach, SVM create a classification model for the features of the known set of training realizations. The classification model can be linear as well as non-linear. This work uses the linear kernel because the number of features is small and a simple linear decision boundary suffices to separate the signals. For higher dimensional feature space, a more complicated decision boundary, a non-linear one, should be used to avoid the underfitting. It is desirable to have the separation between the features of two classes as wide as possible. The features of the new realizations are then predicted to belong to the either class depending upon which side of the decision boundary they lie. A detailed tutorial can be found in [13].
IV-B LR
For training examples, LR is the problem of maximizing the following log-likelihood
[TABLE]
where is the training feature vector example. In this work, is a 3-dimensional feature vector , is the target value. and
[TABLE]
where is dimension of the feature vector .
IV-C NN
In this work, the input layer has three nodes, that is, the dimension of the feature vector is 3. There is only one hidden layer having 10 nodes. Since this work discusses the binary classification problem, the output layer has 1 node only.
V Simulations and Discussion
The simulation experiments are presented illustrating the classification performance of the SVM, LR and NN for the signals having RRC shaped pulses. The performance is measured by the accuracy, which is total number of correct classifications divided by total number of signals.
For CPFSK signals, the modulation index is defined as
[TABLE]
where is the maximum frequency deviation. Using (4) and (17) we get
[TABLE]
where . For BFSK where is the amplitude of the instantaneous frequency pulse in (LABEL:e3a). In this work, FSK signals have carrier frequencies , where is uniformly distributed in . The value of is inversely proportional to the channel’s spectral efficiency. Therefore CPFSK signals having are more useful than those where . Also, SS for is the more challenging scenario than that of Because of the presence of block fading, the SS performance for different values of are very similar. Therefore, only the results for are presented in this work.
There are 10000 signals for each modulation (BFSK, 4-FSK, 8-FSK, BPSK, 4-PSK, 8-PSK and 16-QAM). Therefore the total number of modulated signals is 70000. There are 600 symbols in one realization and the oversampling factor . Carrier offset, , is uniformly distributed in , , where Roll-off of the RRC pulses is in . Both and roll-off of the RRC pulses are fixed for a realization and they vary independently from realization to realization following uniform distributions. and are both uniformly distributed in [0,1), which means that and time delay are non-integer multiples of sampling period. This results in asynchronicity between sampling instants and symbol period. and remain unchanged for a particular realization. is uniformly distributed in . For block fading, is a constant for a realization and varies independently for each realization. has unit mean square value, that is, and is uniformly distributed in .
The signals are simulated to have the joint presence of block fading, unknown roll-off, lack of symbol and sampling synchronization, carrier offset and AWGN.
Figure 1 represents the accuracies of the SVM, LR and NN. For each classifier, the number of training realizations is 7000 (1000 realizations for each modulation) for a particular value of SNR. For this large number of training realizations, NN performs the best, especially for smaller values of SNR. LR performs marginally better than the SVM.
Figures 2, 3 and 4 represent the performances of the SVM, LR and NN, respectively. The performances are shown for different number of training realizations. The performance of the particular classifier do not seem to improve when the number of training realizations is increased from 1000 to 2000 for each modulation. Compared to NN, both SVM and LR perform consistently better with the increasing SNRs for the small number of training (50). The accuracies of the SVM and LR for 50 realizations are comparable to those of 1000 and 2000 training realizations. NN, on the other hand, performs poorly for 50 realizations as compared to its performances for 1000 and 2000 training realizations.
Figure 5 represents the accuracies for a range of training realizations for a fixed value of SNR = 5 dB. It can be seen that the NN performs the best for a larger number of training realizations and LR performs marginally better than the SVM.
VI Conclusions
The features are classified by SVM, LR and NN, trained for different number of realizations. Each classifier performs similarly for training of 1000 and 2000 realizations. For a low number of 50 training realizations, NN performs poorly. Both SVM and LR perform significantly better than the NN. For a larger number of 1000 or 2000 training realizations, NN performs slightly better than the others for SNRs less than 10 dB. The performances become identical after SNR of 10 dB. If more training realizations are available then NN should be used because of its superior performance for lower values of SNR. If fewer training realizations are available then because of the slightly better performance, especially for lower values of SNR, LR should be preferred over SVM.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Ewaisha and C. Tepedelenlioǧlu, “Throughput maximization in multi-channel cognitive radio systems with delay constraints,” in 2013 Asilomar Conference on Signals, Systems and Computers , Nov 2013, pp. 1463–1467.
- 2[2] ——, “Dynamic scheduling for delay guarantees for heterogeneous cognitive radio users,” in 2015 49th Asilomar Conference on Signals, Systems and Computers , Nov 2015, pp. 169–173.
- 3[3] M. Bari and M. Doroslovački, “Simple features for separating CPFSK from QAM and PSK modulations,” IEEE Signal Processing Letters , vol. 22, no. 5, pp. 613–617, May 2015.
- 4[4] ——, “Separation of signals consisting of amplitude and instantaneous frequency RRC pulses using SNR uniform training,” in Proc. 49th Annual Asilomar Conference on Signals, Systems, and Computers , Pacific Grove, CA, USA, Nov. 8-11 2015.
- 5[5] ——, “Distinguishing BFSK from QAM and PSK by sampling once per symbol,” in Proc. 48th Annual Asilomar Conference on Signals, Systems, and Computers , Pacific Grove, CA, USA, Nov. 2-5 2014.
- 6[6] ——, “Distinguishing CPFSK from QAM and PSK modulations,” Circuits, Systems, and Signal Processing , pp. 1–21, July 2015.
- 7[7] ——, “Quickness of the instantaneous frequency based classifier distinguishing BFSK from QAM and PSK modulations,” in Proc. 47th Annual Asilomar Conference on Signals, Systems, and Computers , Pacific Grove, CA, USA, Nov. 3-6 2013, pp. 836–840.
- 8[8] ——, “Order recognition of continuous-phase FSK,” in Proc. 49th Annual Asilomar Conference on Signals, Systems, and Computers , Pacific Grove, CA, USA, Nov. 8-11 2015.
