Impulsive Noise Detection in OFDM-based Systems: A Deep Learning Perspective
Reza Barazideh, Solmaz Niknam, and Balasubramaniam Natarajan

TL;DR
This paper introduces a deep learning-based two-stage method for impulsive noise detection and mitigation in OFDM systems, significantly improving bit error rate performance over traditional threshold-based techniques.
Contribution
It presents a novel deep neural network approach for impulsive noise detection in OFDM systems, demonstrating robustness and superior performance compared to classic methods.
Findings
DNN-based approach outperforms traditional blanking and clipping methods.
The method is robust to model mismatch and bursty impulsive environments.
Simulation results confirm improved bit error rate performance.
Abstract
Efficient removal of impulsive noise (IN) from received signal is essential in many communication applications. In this paper, we propose a two stage IN mitigation approach for orthogonal frequency-division multiplexing (OFDM)-based communication systems. In the first stage, a deep neural network (DNN) is used to detect the instances of impulsivity. Then, the detected IN is blanked in the suppression stage to alleviate the harmful effects of outliers. Simulation results demonstrate the superior bit error rate (BER) performance of this approach relative to classic approaches such as blanking and clipping that use threshold to detect the IN. We demonstrate the robustness of the DNN-based approach under (i) mismatch between IN models considered for training and testing, and (ii) bursty impulsive environment when the receiver is empowered with interleaving techniques.
| Parameters | Values |
|---|---|
| Bandwidth () | 6 kHz |
| No. of Subcarriers () | 1024 |
| Symbol Duration () | 170.7 ms |
| Modulation Scheme | QPSK |
| Channel Length () | 10 |
| Convolution Code Rate () | 1/2 |
| Code Constraint Length | 7 |
| Generator Polynomial | [171,133] |
| Learning Rate () | 0.01 |
| Regularization Hyper Parameter () | 0.1 |
| No. of Samples () | 5 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPower Line Communications and Noise · Electromagnetic Compatibility and Noise Suppression · PAPR reduction in OFDM
Impulsive Noise Detection in OFDM-based Systems: A Deep Learning Perspective
Reza Barazideh*†, Solmaz Niknam†, and Balasubramaniam Natarajan†*
† Department of Electrical and Computer Engineering
Kansas State University, Manhattan, KS, USA.
Email:{rezabarazideh, slmzniknam, bala}@ksu.edu
Abstract
Efficient removal of impulsive noise (IN) from received signal is essential in many communication applications. In this paper, we propose a two stage IN mitigation approach for orthogonal frequency-division multiplexing (OFDM)-based communication systems. In the first stage, a deep neural network (DNN) is used to detect the instances of impulsivity. Then, the detected IN is blanked in the suppression stage to alleviate the harmful effects of outliers. Simulation results demonstrate the superior bit error rate (BER) performance of this approach relative to classic approaches such as blanking and clipping that use threshold to detect the IN. We demonstrate the robustness of the DNN-based approach under (i) mismatch between IN models considered for training and testing, and (ii) bursty impulsive environment when the receiver is empowered with interleaving techniques.
Index Terms:
Impulsive noise (IN), machine learning, deep neural network (DNN), orthogonal frequency-division multiplexing (OFDM).
I Introduction
Impulsive noise (IN) can significantly degrade the performance of any communication system. Although orthogonal frequency-division multiplexing (OFDM) is inherently more resistant to IN than single carrier modulation, system performance can still degrade if IN power exceeds a certain threshold and its effect gets spread over all subcarriers [1]. Many techniques have been explored in prior efforts to mitigate the effect of IN. For example, the high amplitude and the short duration of IN are considered as the main features for threshold based IN mitigation approaches. Memoryless nonlinear approaches such as clipping [2], blanking [3], and multiple-threshold blanking/clipping [4] are the most common methods in this category. In [5], a threshold optimization based on Neyman-Pearson criterion is proposed and an analytical equation for the quasi-optimal blanking and clipping thresholds is provided in [6]. Authors in [7, 8, 9, 10] take advantage of analog domain processing where the impulsive noise is still broadband and distinguishable. Note that, determining thresholds in analog domain techniques is not trivial. The performance of threshold based nonlinear approaches is highly sensitive to the selected thresholds and as shown in [11], the performance of all these methods degrades dramatically in severe impulsive environment.
Machine learning methods such as deep learning are becoming popular in growing number of applications in signal and image processing [12, 13], and resource allocation in wireless networks [14, 15]. If appropriate network structures and processing strategies are employed, deep neural network (DNN) may be used as powerful tools for efficient detection of impulse noise because of their ability to learn from examples and capability to account for uncertainty that is common in the most communication applications. Additionally, in classical outlier detection approaches, determining the optimum threshold is the main challenge as this threshold will vary in response to channel conditions and model mismatches. Lastly, the high peak-to average-power-ratio (PAPR) of OFDM signals can also degrade the performance of the classical methods. As always, there is a compromise between detection and false alarm probability in the traditional threshold based methods.
To overcome the aforementioned drawbacks, we propose a machine learning based IN suppression strategy for an OFDM-based communication system. The proposed IN mitigation approach comprises of two stages: (i) IN detection and (ii) IN suppression. In the first stage, a DNN is used to detect the IN corrupted signal instances. Then, the detected IN can be either blanked or clipped in the suppression stage to alleviate the harmful effects of outliers. The proposed DNN-based IN detection approach can be used in conjunction with any IN mitigation strategy as the operation of the detector is completely independent of the noise removal operator. The proposed DNN consists of multiple layers (input, hidden, output) with nodes in a fully connected structure that maps input data into appropriate outputs. Each node in the hidden layers has a nonlinear activation function which helps to distinguish data that are not linearly separable. Here, the DNN uses the current sample value, median deviations filter output [16], and Rank-Ordered Absolute Differences (ROAD) statistic [17] as the inputs to determine if the current sample is corrupted by IN or not. Bit error rate (BER) performance in an OFDM-based communication system is used to evaluate and compare the capability of the proposed DNN-based IN mitigation approach with other conventional approaches such as blanking and clipping. The robustness of the proposed approach is highlighted by testing the performance with IN model different from the model used for training. In addition, we evaluate the robustness of our method in bursty IN when the receiver is accompanied by time domain interleaving techniques. Simulation results show that the DNN-based approach offers up to 2 dB gains relative to blanking and clipping at BER .
The remainder of this paper is organized as follows. Section II describes the system and noise models. Section III presents the structure of the proposed DNN and its input features. The proposed algorithm for IN mitigation is detailed in Section IV. The performance of the IN detector is analyzed in Section V and finally conclusions are drawn in Section VI.
II System Model
Consider the OFDM system shown in Fig. 1. At the transmitter, information bits are channel coded and then the encoded bits are interleaved. Subsequently, the interleaved data is modulated and then passed through an inverse discrete Fourier transform (IDFT) module to generate OFDM symbols over orthogonal subcarriers. In general, an OFDM symbol can be constructed with non-data subcarriers and data subcariers. The non-data subcarriers are either pilots for channel estimation and synchronization, or nulled for spectral shaping and ICI reduction. Let the nonoverlapping sets of data, pilot, and null subcarriers be defined as , , and , respectively. Therefore, after digital-to-analog conversion the transmitted signal envelope in the time domain can be expressed as
[TABLE]
where represents the set of active subcarriers; is the modulated symbol on the subcarrier; and is the OFDM symbol duration.
The channel can be modeled as a linear time-varying system described by the channel impulse response
[TABLE]
where is the length of the channel impulse response; and are the amplitude and the delay of the multipath component, respectively. Therefore, the received signal after down conversion, analog-to-digital conversion, guard interval removing, and synchronization can be expressed as
[TABLE]
where ; is the mixture of additive white Gaussian noise (AWGN) and IN .
Here it is assumed that the noise samples are uncorrelated and their distribution can be expressed in terms of multi-component mixture-Gaussian model [11]. Corresponding to this model, the probability density function (PDF) of the noise samples is obtained as
[TABLE]
where is the PDF of the complex Gaussian variable with zero-mean and variance , and and are the model parameters such that . The noise model (4) can support two commonly used IN models. The first IN model is a two component mixture-Gaussian noise model or Bernoulli Gaussian (BG) noise model [1] with model parameters corresponding to
[TABLE]
Here is the probability of the incoming impulse noise, is the variance of AWGN component, and presents the variance of the IN. The expression in (4) can also be used to characterize a Middleton Class A (MCA) IN model [18] with the following parameters
[TABLE]
where is the noise variance of , is the impulsiveness index designed as the product of the mean number of impulses per time unit and the mean length of an impulse (in time units), and denotes the background-to-IN power ratio [18]. The noise model in (4) is used to train the proposed DNN. In order to investigate the system performance when there is a model mismatch between training and testing, we also consider Symmetric Alpha Stable (SS) IN which can be expressed as [19]
[TABLE]
where denotes the stability parameter that sets the degree of the impulsiveness of the distribution; is the location parameter; is called the skewness parameter and is a measure of asymmetry ( for SS distribution); represents the scale parameter which is a measure of the width of the distribution.
III Deep Neural Network Design
In order to deal with IN, a DNN is exploited to find the instances of impulsivity. DNN is a black-box approach that can be used to model any nonlinear system if properly trained. In this section, the structure of DNN is introduced and then the input features are presented.
III-A DNN Structure
As shown in Fig. 2, the considered neural network consists of two hidden layers with and hidden neurons in each layer, respectively. Typically, there is no analytical method to choose the number of layers and neurons, and hence they are determined experimentally on a trial and error basis. Here, represents the input vector consisting of three features (as discussed in the next subsection) and denotes the output of the DNN. There is only one node in the output layer, which generate a binary sequence of zeros and ones. Note that the soft outputs of DNN will be rounded off to a [math] or . An output indicates that the received sample is corrupted by IN and output [math] implies that the received sample is uncorrupted. According to Fig. 2, the relation between layers can be expressed as
[TABLE]
where , , and are the parameter matrix, bias vector, and activation function of layer that will be applied to the output of the previous layer. The activation function is a nonlinear function in general, but can also be designed to retain linearity in the transformation process. In this paper, the Rectified linear unit (ReLU) function is used for the hidden layers and a Sigmoid function is used in the output layer. The ReLU and Sigmoid functions are expressed as
[TABLE]
[TABLE]
Loss or cost function is a function that returns the loss or penalty associated with a predicted value when the true value is over the entire training set. This loss function value decreases when the difference between the predicted value and the correct value decreases. The loss function that is used in this work corresponds to
[TABLE]
where is the number of training samples; represents the number of neurons in layer ; and denotes the regularization hyper parameter that is used to prevent over-fitting in the training phase. The DNN aims to determine the weights W and the bias vector b that minimize the loss function, i.e.,
[TABLE]
The proposed DNN is trained using the back-propagation algorithm along with Adam optimization algorithm [20]. The Adam optimization is an extension to stochastic gradient descent and has recently seen broader adoption in deep learning applications. Adam computes adaptive learning rates for each parameter at time instant . According to the Adam algorithm, the update rule for each parameter in layer is given by
[TABLE]
Here, is learning rate hyper parameter and
[TABLE]
[TABLE]
where the proposed default values are , , , and the initial value for and are randomly chosen.
III-B DNN Input Features
Feature extraction is one of the most important aspects of machine learning because it turns raw data into information that is suitable for inferencing. Feature extraction eliminates the redundancy present in many types of measured data, facilitating generalization which is critical to avoiding over-fitting during the learning phase. According to Fig. 2, the input layer has three nodes which are (i) the current sample value, (ii) Rank-Ordered Absolute Differences (ROAD) statistic, and (iii) median deviations filter output. In the following we briefly introduce the ROAD and median deviation features.
III-B1 ROAD Value
The ROAD value is an efficient statistic for distinguishing between corrupted and uncorrupted samples as its value is high for noisy samples and low for uncorrupted samples [17]. In general, ROAD factor is widely used in image processing for two dimensional (2D) signals. Here, we compute the ROAD factor for a one dimensional received signal as follows:
i. The absolute difference between the centre sample and the remaining samples of a vector is calculated and denoted by which consists of elements:
[TABLE]
ii. Sort values in increasing order:
[TABLE]
iii. The ROAD factor is calculated by summing up the first values of :
[TABLE]
III-B2 Median Deviations Filter
The median-deviations filter to obtain can be expressed as
[TABLE]
where the median filter used in (19) is a standard median filter which operates on a moving window of samples.
IV Impulsive Noise Mitigation
After the proposed DNN determines if a received sample is contaminated with IN or not, a simple memoryless nonlinear preprocessor such as blanking can be used to alleviate the effect of IN. Therefore, the output of blanking nonlinearity can be expressed as
[TABLE]
where is the output of the DNN. It is worth mentioning that one can use other nonlinear preprocessors proposed in the literature to suppress the impact of IN. This extension is straightforward and is not the main focus of this paper. After IN mitigation a discrete Fourier transform (DFT) module is used to transform the time domain signal to the frequency domain. The DFT module is followed by frequency domain equalization that depends on channel estimation which can be performed based on pilot subcarriers. Viterbi soft decoding is used to decode the demodulated signal and then detection is performed based on the modulation scheme used.
V Simulation results
In this section, an OFDM-based communication system with QPSK modulation in the presence of channel fading, channel coding, and IN is studied. The BER performance is used to compare the proposed DNN-based IN mitigation with other conventional approaches such as blanking and clipping. Since the distribution of the received OFDM signal in case of no IN can be considered as Gaussian, the threshold value for blanking and clipping in all scenarios is obtained based on the approach provided in [5].
We set and as the number of neurons in the first and the second hidden layers, respectively. With three input features and according to Fig. 2, is matrix and is bias vector that connects the input layer to the first hidden layer. After applying the activation function , the matrix with size and the bias vector with size will connect the first hidden layer to the second hidden layer. Finally, is matrix and is a bias that connects the second hidden layer to the output layer. Since the standard gradient descent from random initialization performs poorly with DNN, the initial values for all parameters is chosen based on Xavier initializer [21]. Here, the considered DNN is trained based on the signal model in (3) and noise model in (4). Specifically, the training set consists of 1000 OFDM symbols with a range of and SIR that span the operating regions of interested. The samples with different and SIR values in the training data set is randomly shuffled to remove any trend that may exist.
For a quick reference, the simulation parameters for the considered coded OFDM system in fading channel are listed in Table I. A total of 1024 subcarriers are used with 672 carrying data, 256 pilot, and 96 null subcarriers. Channel estimation is done based on pilot subcarriers which are equally spaced between 1024 subcarriers. A 10-path fading channel is considered with path arrival times following a Poisson distribution with mean 1 . The path amplitudes are Rayleigh distributed with exponentially decreasing average power.
The BER performance of the proposed DNN-based IN mitigation approach under two different test settings (i) BG noise with SIR = 0 dB, and (ii) MCA with = 0.2 and =10 are shown in Fig. 3(a) and Fig. 3(b), respectively. As expected the BER performance will degrade with increase in the frequency of IN occurrence. Fig. 4 compares the BER performance of the DNN with blanking (BLN) and clipping (CLP) for different IN models in various levels of impulsivity. From Fig. 4, it is evident that DNN outperforms both blanking and clipping in all scenarios of both BG and MCA noise models with gains close to 2 dB at BER of . Fig. 4(b) shows that at high SINR (signal to impulsive plus thermal noise ratio), blanking and clipping are very vulnerable as the level of peakedness decreases and it is difficult to find a proper threshold to distinguish between desired and contaminated signals. On the other hand, a well trained DNN can handle the IN detection process even when the signal and IN peakedness is low. Although, the performance loss of DNN with increase in the frequency of IN occurrence is noticeable, it still outperforms other approaches in all scenarios.
Fig. 5 illustrates the robustness of the proposed DNN approach under IN model mismatch. Although the proposed DNN is trained based on the noise model in (4), the DNN-based method is the most robust technique relative to blanking and clipping in SS noise model. The performance degradation in blanking and clipping comes from the fact that the threshold calculation is performed based on Gaussian mixture assumption for the received signal which does not hold in this scenario. Fig. 6 also investigates the BER performance of the considered DNN-based method in bursty IN environment when a time domain interleaver is included in the receiver. In Fig. 6, the parameter Num denotes the number of consecutive contaminated samples by IN. As shown in Fig. 6, the DNN is able to find the IN instances while the level of burstiness can be alleviated by time domain interleaving. From Fig. 6 it is obvious that the best performance is achieved when the duration of IN is short.
VI Conclusions
In this work, a deep neural network (DNN) is proposed to determine if a received sample is contaminated with impulsive noise (IN) or not in an OFDM-based communication system. The Rank-Ordered Absolute Differences (ROAD) along with median deviations filter is used as input features for the DNN. In the next stage, a nonlinear preprocessor such as blanking is used to suppress the effect of IN in corrupted samples. Simulation results show that the DNN-based approach offers significant improvement in the BER performance in the presence of strong impulsive component. Moreover, the DNN-based IN mitigation outperforms other conventional threshold-based outlier mitigation methods such as blanking and clipping with providing lower BER in IN environments. We also show that DNN-based approach is robust to IN model mismatches and can effectively deal with bursty IN when the receiver includes time domain interleaving. To extend this work one can exploit reinforcement learning to accomplish the impulsive noise mitigation.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Ghosh, “Analysis of the effect of impulse noise on multicarrier and single carrier QAM systems,” IEEE Trans. Commun. , vol. 44, no. 2, pp. 145–147, Feb. 1996.
- 2[2] D.-F. Tseng et al. , “Robust clipping for OFDM transmissions over memoryless impulsive noise channels,” IEEE Commun. Lett. , vol. 16, no. 7, pp. 1110–1113, Jul. 2012.
- 3[3] C.-H. Yih, “Iterative interference cancellation for OFDM signals with blanking nonlinearity in impulsive noise channels,” IEEE Signal Process. Lett. , vol. 19, no. 3, pp. 147–150, Mar. 2012.
- 4[4] N. Rozic, P. Banelli, D. Begusic, and J. Radic, “Multiple-threshold estimators for impulsive noise suppression in multicarrier communications,” IEEE Trans. Signal Process. , vol. 66, no. 6, pp. 1619–1633, Mar. 2018.
- 5[5] G. Ndo, P. Siohan, and M. H. Hamon, “Adaptive noise mitigation in impulsive environment: Application to power-line communications,” IEEE Trans. Power Del. , vol. 25, no. 2, pp. 647–656, Apr. 2010.
- 6[6] H. Oh and H. Nam, “Design and performance analysis of nonlinearity preprocessors in an impulsive noise environment,” IEEE Trans. Veh. Technol. , vol. 66, no. 1, pp. 364–376, 1 2017.
- 7[7] R. Barazideh, B. Natarajan, A. V. Nikitin, and R. L. Davidchack, “Performance of analog nonlinear filtering for impulsive noise mitigation in OFDM-based PLC systems,” in IEEE Latin-American Conf. Commun. , Nov. 2017, pp. 1–6.
- 8[8] R. Barazideh, A. V. Nikitin, and B. Natarajan, “Practical implementation of adaptive analog nonlinear filtering for impulsive noise mitigation,” in IEEE Int. Conf. on Commun. (ICC) , May 2018, pp. 1–7.
