On the Use of Power Amplifier Nonlinearity Quotient to Improve Radio   Frequency Fingerprint Identification in Time-Varying Channels

Lu Yang; Seyit Camtepe; Yansong Gao; Vicky Liu; Dhammika Jayalath

arXiv:2302.13724·eess.SP·November 27, 2023·PIMRC

On the Use of Power Amplifier Nonlinearity Quotient to Improve Radio Frequency Fingerprint Identification in Time-Varying Channels

Lu Yang, Seyit Camtepe, Yansong Gao, Vicky Liu, Dhammika Jayalath

PDF

Open Access

TL;DR

This paper introduces a novel PA nonlinearity quotient and transfer learning approach to enhance radio frequency fingerprint identification robustness against environmental variations, significantly improving device detection and classification accuracy in IoT scenarios.

Contribution

The study formalizes the PA nonlinearity quotient as environment-independent and applies transfer learning to improve RFFI accuracy and reduce resource requirements.

Findings

01

PA nonlinearity quotient is environment-independent

02

Transfer learning improves classification accuracy

03

Accuracy increased by over 33% in indoor and outdoor tests

Abstract

Radio frequency fingerprint identification (RFFI) is a lightweight device authentication technique particularly desirable for power-constrained devices, e.g., the Internet of things (IoT) devices. Similar to biometric fingerprinting, RFFI exploits the intrinsic and unique hardware impairments resulting from manufacturing, such as power amplifier (PA) nonlinearity, to develop methods for device detection and classification. Due to the nature of wireless transmission, received signals are volatile when communication environments change. The resulting radio frequency fingerprints (RFFs) are distorted, leading to low device detection and classification accuracy. We propose a PA nonlinearity quotient and transfer learning classifier to design the environment-robust RFFI method. Firstly, we formalized and demonstrated that the PA nonlinearity quotient is independent of environmental changes.…

Tables3

Table 1. TABLE I: DUT Configurations

Carrier

Frequency

Bandwidth

(

B

)

Transmission

Power (h/l)

Spreading

Factor (

S ​ F

)

Coding

Rate

\bigstrut

915 MHz

62.5 kHz

17/10 dBm

10

4/5 \bigstrut

Table 2. TABLE II: Layers, Parameters, and Activation of the Proposed Classifier

Layer	Dimension	Parameters	Activation
Input	$256 \times 256$	—	—
Convolution, BN	$8 \times (3 \times 3)$	80, 16	ReLU
MaxPooling	$2 \times 2$	—	—
Convolution, BN	$16 \times (3 \times 3)$	1168, 32	ReLU
MaxPooling	$2 \times 2$	—	—
Convolution, BN	$32 \times (3 \times 3)$	4640, 64	ReLU
FullyConnected	20	2304020	SoftMax

Table 3. TABLE III: Device Classification Comparison With Notable Works

Work

Experimental

Environment

No. of

Devices

Training Samples

(Per Device)

Classification

Accuracy

Ours

Indoor

Outdoor

20

200

99.4%

98.2%

[23]

Indoor

30

100

98.4%

[41]

Indoor

54

698

84.6%

[42]

Indoor

7

800

99.0%

Equations17

s (t) = h (τ, t) * f [x (t)] + n (t),

s (t) = h (τ, t) * f [x (t)] + n (t),

S_{p} = S_{p}^{1, 1} S_{p}^{2, 1} ⋮ S_{p}^{W, 1} S_{p}^{1, 2} S_{p}^{2, 2} ⋮ S_{p}^{W, 2} \dots \dots ⋱ \dots S_{p}^{1, M} S_{p}^{2, M} ⋮ S_{p}^{W, M},

S_{p} = S_{p}^{1, 1} S_{p}^{2, 1} ⋮ S_{p}^{W, 1} S_{p}^{1, 2} S_{p}^{2, 2} ⋮ S_{p}^{W, 2} \dots \dots ⋱ \dots S_{p}^{1, M} S_{p}^{2, M} ⋮ S_{p}^{W, M},

S_{p}^{w, m} = n = 0 \sum W - 1 s_{p} [n] g [n - m R] e^{- j 2 π \frac{w}{W} n}

S_{p}^{w, m} = n = 0 \sum W - 1 s_{p} [n] g [n - m R] e^{- j 2 π \frac{w}{W} n}

for w = 1, 2, ..., W and m = 1, 2, ..., M,

M = \frac{K \cdot \frac{2 ^{S F}}{B} \cdot f _{S} - W}{R} + 1,

M = \frac{K \cdot \frac{2 ^{S F}}{B} \cdot f _{S} - W}{R} + 1,

S_{h} = H^{1, 1} F_{h} (X^{1, 1}) H^{2, 1} F_{h} (X^{2, 1}) ⋮ H^{W, 1} F_{h} (X^{W, 1}) H^{1, 2} F_{h} (X^{1, 2}) H^{2, 2} F_{h} (X^{2, 2}) ⋮ H^{W, 2} F_{h} (X^{W, 2}) \dots \dots ⋱ \dots H^{1, M} F_{h} (X^{1, M}) H^{2, M} F_{h} (X^{2, M}) ⋮ H^{W, M} F_{h} (X^{W, M}),

S_{h} = H^{1, 1} F_{h} (X^{1, 1}) H^{2, 1} F_{h} (X^{2, 1}) ⋮ H^{W, 1} F_{h} (X^{W, 1}) H^{1, 2} F_{h} (X^{1, 2}) H^{2, 2} F_{h} (X^{2, 2}) ⋮ H^{W, 2} F_{h} (X^{W, 2}) \dots \dots ⋱ \dots H^{1, M} F_{h} (X^{1, M}) H^{2, M} F_{h} (X^{2, M}) ⋮ H^{W, M} F_{h} (X^{W, M}),

S_{l} = H^{1, M + 1} F_{l} (X^{1, 1}) H^{2, M + 1} F_{l} (X^{2, 1}) ⋮ H^{W, M + 1} F_{l} (X^{W, 1}) H^{1, M + 2} F_{l} (X^{1, 2}) H^{2, M + 2} F_{l} (X^{2, 2}) ⋮ H^{W, M + 2} F_{l} (X^{W, 2}) \dots \dots ⋱ \dots H^{1, 2 M} F_{l} (X^{1, M}) H^{2, 2 M} F_{l} (X^{2, M}) ⋮ H^{W, 2 M} F_{l} (X^{W, M}) .

S_{l} = H^{1, M + 1} F_{l} (X^{1, 1}) H^{2, M + 1} F_{l} (X^{2, 1}) ⋮ H^{W, M + 1} F_{l} (X^{W, 1}) H^{1, M + 2} F_{l} (X^{1, 2}) H^{2, M + 2} F_{l} (X^{2, 2}) ⋮ H^{W, M + 2} F_{l} (X^{W, 2}) \dots \dots ⋱ \dots H^{1, 2 M} F_{l} (X^{1, M}) H^{2, 2 M} F_{l} (X^{2, M}) ⋮ H^{W, 2 M} F_{l} (X^{W, M}) .

Q = S_{h} ./ S_{l} = [\frac{F _{h} ( X ^{1} )}{F _{l} ( X ^{1} )} \frac{F _{h} ( X ^{2} )}{F _{l} ( X ^{2} )} \dots \frac{F _{h} ( X ^{M} )}{F _{l} ( X ^{M} )}],

Q = S_{h} ./ S_{l} = [\frac{F _{h} ( X ^{1} )}{F _{l} ( X ^{1} )} \frac{F _{h} ( X ^{2} )}{F _{l} ( X ^{2} )} \dots \frac{F _{h} ( X ^{M} )}{F _{l} ( X ^{M} )}],

Q = 10 lo g_{10} (∣ Q ∣^{2}) .

Q = 10 lo g_{10} (∣ Q ∣^{2}) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWireless Signal Modulation Classification · Full-Duplex Wireless Communications · Radar Systems and Signal Processing

MethodsBalanced Selection

Full text

On the Use of Power Amplifier Nonlinearity Quotient to Improve Radio Frequency Fingerprint Identification in Time-Varying Channels

Lu Yang1, Seyit Camtepe2, Yansong Gao2, Vicky Liu3, and Dhammika Jayalath1

1Faculty of Engineering, Queensland University of Technology, Brisbane, Australia

2Data61, CSIRO, Sydney, Australia

3Faculty of Science, Queensland University of Technology, Brisbane, Australia

Corresponding Author: Lu Yang (Email: [email protected])

Abstract

Radio frequency fingerprint identification (RFFI) is a lightweight device authentication technique particularly desirable for power-constrained devices, e.g., the Internet of things (IoT) devices. Similar to biometric fingerprinting, RFFI exploits the intrinsic and unique hardware impairments resulting from manufacturing, such as power amplifier (PA) nonlinearity, to develop methods for device detection and classification. Due to the nature of wireless transmission, received signals are volatile when communication environments change. The resulting radio frequency fingerprints (RFFs) are distorted, leading to low device detection and classification accuracy. We propose a PA nonlinearity quotient and transfer learning classifier to design the environment-robust RFFI method. Firstly, we formalized and demonstrated that the PA nonlinearity quotient is independent of environmental changes. Secondly, we implemented transfer learning on a base classifier generated by data collected in an anechoic chamber, further improving device authentication and reducing disk and memory storage requirements. Extensive experiments, including indoor and outdoor settings, were carried out using LoRa devices. It is corroborated that the proposed PA nonlinearity quotient and transfer learning classifier significantly improved device detection and device classification accuracy. For example, the classification accuracy was improved by 33.3% and 34.5% under indoor and outdoor settings, respectively, compared to conventional deep learning and spectrogram-based classifiers.

Index Terms:

Internet of things, device authentication, radio frequency fingerprinting identification, power amplifier nonlinearity, transfer learning.

I Introduction

The rapid growth of the Internet of things (IoT) device population has sparked extensive demands on IoT security in recent years. Many security-critical IoT applications need more stringent security support [1]. Device authentication is one of the most important categories, which includes rogue device detection and the classification of registered devices [2]. Traditionally, device authentication is achieved by public-key cryptography (PKC). However, the implemented public key algorithms are not optimal for IoT devices because they are computationally costly. Further, PKC generally requires a certification authority when sharing keys. The authority may not always be available, considering the large volume and wide-area deployment of IoT devices [3].

A lightweight and reliable authentication technique is thus required for IoT security. Radio frequency fingerprint identification (RFFI) is a non-cryptographic authentication technique that attracted much research interest [4, 5, 6, 7]. It exploits the intrinsic features brought by various hardware impairments resulting from imperfect manufacturing processes. The features manifested as slight distortions on transmitted signals. Like the biometric characteristics used for authentication, the subtle features are unique for different devices and hard to duplicate. Therefore, receivers can extract the features from received signals, followed by the verification with the pre-shared feature information for device authentication. The process does not involve computationally costly algorithms; hence, it consumes less energy and is suitable for power-constrained IoT devices.

An RFFI classifier is a machine learning model trained using radio frequency fingerprints (RFFs) for multi-class classification. Specifically, deep learning is leveraged as it minimizes the process of locating transient signal segments [8, 9, 10, 11, 12]. It automatically extracts RFFs from received signals, making it the technique requiring minimal manual selection to train RFFI classifiers. For the network architecture, convolutional neural network (CNN) is mostly implemented for image recognition tasks, which makes it especially suitable for device fingerprinting [13, 14, 15, 16, 17, 18]. Among the feature selection, in-phase and quadrature (IQ) samples [19], FFT results [20, 21], and spectrogram [22] are widely studied. In [22], the spectrogram CNN model was shown to achieve the highest classification accuracy. Therefore, we adopt deep learning and spectrogram-based classifiers to benchmark proposed classifiers.

Due to the nature of wireless communications, RFFs are susceptible to environmental changes. Large-scale fading, multipath fading, and the Doppler effect affect wireless channels and modify received signal characteristics [23, 24, 25]. Traditional RFFs, e.g., spectrogram, extracted from the received signals are distorted and cannot be used for authentication [26, 27, 28]. We propose using a power amplifier (PA) nonlinearity quotient to mitigate the wireless channel effects introduced by environmental changes. The PA nonlinearity quotient is generated by taking division on the frequency domain of two consecutive signals transmitted with different power. The division mitigates the wireless channel effects, and RFFI classifiers are trained to exploit the resulting RFFs.

Implementing environment-robust RFFs is limited when communication environments have many fast-moving objects because multipath fading and the Doppler effect mostly dominate wireless channels. Particularly, fast fading can happen when transmitters have low data rates, i.e., IoT devices. In [29, 28, 23], data augmentation is implemented to alleviate the impact of fast fading by training classifiers under channels with simulated multipath fading and the Doppler effect. However, the simulations had no pre-knowledge of the real deployment environments and significantly increased the required disk and memory storage for training classifiers.

Transfer learning can be implemented to combine RFFs resulting from different wireless channels [30, 31, 32]. Hence, distortions caused by multipath fading and the Doppler effect are acknowledged in device authentication. The required storage for transfer learning is less than data augmentation. Therefore, we implement transfer learning to alleviate the impact of fast fading. Specifically, a base classifier is trained with the original RFFs of the devices under test (DUTs); then, the classifier is retrained with the RFFs collected in real deployment environments.

This paper aims to design and validate an environment-robust RFFI system for IoT device authentication. The approach trains a classifier using the PA nonlinearity quotient. Transfer learning is adopted to alleviate the impact of fast fading and reduce training costs. Extensive experiments, including indoor and outdoor settings, were carried out using LoRa devices. The results show that the proposed PA nonlinearity quotient and transfer learning classifier significantly outperformed conventional deep learning and spectrogram-based classifiers. Our contributions are summarized as follows.

•

We formalized the PA nonlinearity quotient and demonstrated that it is independent of environmental changes. The improvements in rogue device detection and device classification are backed by experimental validation.

•

We developed data collection of real deployment, including indoor and outdoor environments. Further, we implemented transfer learning using the data to alleviate the impact of fast fading. The approach reduced the disk and memory storage requirements for training. The resulting classifiers have pre-knowledge of the real deployment environments compared to the data augmentation approach.

•

We designed an RFFI system that involves the PA nonlinearity quotient and transfer learning. Samples resulting from natural multipath fading and the Doppler effect were implemented to validate the system.

II Power Amplifier Nonlinearity Quotient

The PA is an indispensable component in any wireless device, with the implementation to amplify low-power signals to high-power ones. It is inherently nonlinear [33]. For low-power and narrowband systems, i.e., IoT devices, the PA is regarded as memoryless, meaning the nonlinear output depends only on the input at a particular time. The nonlinearity can be characterized by an amplitude/amplitude (AM/AM) function and an amplitude/phase (AM/PM) function. Several models have been proposed to formulate the functions [33].

Implementing PA nonlinearity for RFFI is widely studied in the literature [34, 35, 36, 37, 38, 39]. However, the implementation is often limited for static or semi-static channels. The RFFI performance drops significantly when communication environments change. We propose the PA nonlinearity quotient to design an environment-robust RFFI.

The signal of a narrowband system that reaches a receiver is given as

[TABLE]

where $x(t)$ is baseband signal, $h(\tau,t)$ is channel impulse response, $f[\cdot]$ denotes the nonlinear effect of hardware impairment at transmission power, and $n(t)$ is additive white Gaussian noise (AWGN). “ $\ast$ ” denotes convolution operation.

When generating the PA nonlinearity quotient, two consecutive signals emitted with high and low transmission power correspondingly are received and developed an element-wise division on the frequency domain. The signal representation on the frequency domain is obtained through the short-time Fourier transform (STFT). The result of the STFT on the received signal is a matrix expressed as

[TABLE]

where $p=\{h,l\}$ denotes high-power and low-power, respectively. The elements in the matrix are given as

[TABLE]

where $s_{p}\left[n\right]$ is the discrete signal received by the receiver with a sampling interval, $g\left[n\right]$ is the window function with length $W$ , and $R$ is hop size. The experiments implement LoRa, hence $M$ is given by LoRa configurations as

[TABLE]

where $K$ is number of LoRa symbols, $SF$ is LoRa spreading factor, $B$ is bandwidth, and $f_{S}$ is sampling frequency. The configurations are discussed in Section III-A. $W$ is 1024 and $R$ is 512. $M$ is calculated to be 319.

The STFT result of the high-power signal is expressed as (5.1), where $X$ denotes the ideal spectrum of the transmitted signal, $H$ denotes the channel frequency response, and $F(\cdot)$ denotes the nonlinear hardware effect at the transmission power in the frequency domain. Only the preamble of the received signal is used to generate the PA nonlinearity quotient. The ideal spectrum of the low-power preamble is the same as the high-power one, i.e., $X^{w,m}=X^{w,M+m}$ . Hence, the STFT result of the consecutive low-power signal is given as (5.2).

By removing the significantly distorted preambles caused by fast-moving objects nearby and implementing transfer learning, we assume intense multipath fading and the Doppler effect are mitigated. Slow fading mostly dominates the wireless channels. Therefore, the channel frequency response does not change significantly during one packet duration, i.e., $H^{w,m}\approx H^{w,M+m}$ . The result of the element-wise division of received signals on the frequency domain ( $\boldsymbol{Q}$ ) is given as

[TABLE]

where “ $./$ ” denotes the element-wise division operation and $\boldsymbol{X^{m}}=[X^{1,m}\quad X^{2,m}\quad\cdots\quad X^{W,m}]^{T}$ . No channel frequency response ( $H$ ) is present in $\boldsymbol{Q}$ . The proposed environment-robust RFFI can be developed exploiting the PA nonlinearity quotient, which is $\boldsymbol{Q}$ in dB scale, expressed as

[TABLE]

III Experiments

III-A Experimental Settings

The experiments implemented 25 Arduino Nano-controlled LoRa SX1276 modules with the same circuit design and specifications as DUTs. 20 DUTs were randomly selected as legitimate devices (DUT: “A” to “T”), and 5 DUTs were selected as rogue devices (DUT: “Attacker 1” to “Attacker 5”). The device configurations are given in Table I. The LoRaWAN protocol supports 125 kHz, 250 kHz, and 500 kHz bandwidths, while LoRa supports bandwidths ranging from 7.8 kHz to 500 kHz. The proposed RSSI system does not focus on specific protocols. Therefore, a bandwidth of 62.5 kHz was used to reduce packet loss and maintain high throughputs. A universal software radio peripheral (USRP) platform with a 1 MS/s sampling frequency ( $f_{S}$ ) was used to collect RF samples. Fig. 1 shows the devices used in the experiments.

The data collection was developed in three environments.

•

Anechoic chamber: the collection of channel effect-free RFFs for training the base classifier required by transfer learning was carried out in the anechoic chamber on the top floor of the QUT GP campus S-block building. DUTs were placed 3 meters away from the USRP platform. The anechoic chamber was designed to absorb multipath signals. Therefore, RF samples collected in the environment can generate the PA nonlinearity quotient without the impact of multipath fading and the Doppler effect.

•

Indoor: DUTs were placed in an office room for the indoor setting. The USRP platform was placed in the adjacent room, and DUT signals traveled through a wall. People were freely walking in the office during the data collection. The environment was considered to have moderate multipath fading and a slight Doppler effect.

•

Outdoor: in the outdoor setting, DUTs were placed 104.5 meters away from the USRP platform, as shown in Fig. 2. Buildings blocked the line of sight, and people freely walked in the environment. The outdoor environment was considered to have more significant multipath fading and the Doppler effect than the indoor environment.

The DUTs transmit packets with alternating high-power and low-power modes, and the USRP platform passively receives the packets in the data collection. More than 2800 packets were collected for each DUT within one hour. Hence, more than 8400 packets were collected for each DUT in all three experimental environments.

III-B Data Preprocessing

The data preprocessing includes synchronization, preamble extraction, normalization, and the PA nonlinearity quotient generation. The packets collected indoors and outdoors are required to go through the distorted preamble removal process before generating the PA nonlinearity quotient.

Synchronization: transmission power does not impact the data rate. Hence, the time-on-air for the DUT packets stays unchanged for the high-power and low-power transmission. The starting points of the packets are marked and used for synchronization to avoid inaccurate preamble extraction. 2. 2.

Preamble extraction: preambles are payload-independent and have no software-defined features such as MAC addresses. Therefore, the intrinsic hardware features in the preamble symbols are the desirable source for RFFI. The preamble length is a flexible configuration for LoRa, with a minimum value of ten symbols. To study the worst-case scenario, we set and extracted ten symbols for one preamble per packet in the experiments. 3. 3.

Normalization: the process normalizes the received signal magnitude to remove the device-specific DC offset by dividing the root mean square. The PA nonlinearity feature is unaffected. 4. 4.

Distortion removal and PA nonlinearity quotient generation: we introduce Algorithm 1 to remove the severely distorted preambles caused by fast-moving objects nearby. The correlation between the high-power and low-power spectrogram should stay the same since PA nonlinearity is only affected by the input power [40]. The distorted preambles can be found by comparing the correlation of the channel-affected spectrogram to the correlation of the anechoic chamber spectrogram. The distortion is considered severe and can be removed if the difference is over a tolerance ( $\theta=0.2$ implemented in experiments). After the distortion removal, an element-wise division on the frequency domain is developed to generate the PA nonlinearity quotient. Fig. 3 shows the collected preamble spectrogram and the PA nonlinearity quotient generated by a DUT.

III-C Analytical Metrics

Device authentication exploiting RFFI involves two essential parts: device classification and rogue device detection. The classification accuracy and receiver operating characteristic (ROC) curve are implemented to evaluate the device classification and rogue device detection performance, respectively.

III-C1 Classification Accuracy

The classification accuracy is defined as the number of correctly classified RFFs divided by the total number of tested RFFs. The results are obtained from the confusion matrix after developing classification tests.

III-C2 ROC Curve

The rogue device detection was studied as binary classification in the experiments. The output values of the softmax function are compared to a threshold. The RFFs associated with the output values smaller than the threshold will be considered unauthorized. Since the threshold is configurable, it is hard to use a detection rate to analyze classifiers’ performance. We adopted the ROC curve in the binary classifier study to overcome this. For each class of a classifier, ROC analysis applies threshold values in [0,1] to calculate the true-positive rate (TPR) and the false-positive rate (FPR) for the outputs generated by each threshold. The area under the ROC curve (AUC) is the integral of a ROC curve with respect to FPR. The value of AUC is in the range of 0 to 1. A larger AUC indicates better classifier performance. In our experiments, a larger AUC indicates that the classifier is more capable of detecting rogue devices. A micro-averaging method is applied to generate the averaged AUC and ROC curves to analyze the rogue device detection for all classes.

IV Classifier Architecture

The architecture of the PA nonlinearity quotient and transfer learning classifier is summarized in Table II. It consists of three convolution layers with 8, 16, and 32 $3\times 3$ filters, respectively. A batch normalization layer and the rectified linear unit (ReLU) activation follow each convolution layer. After the activation, a $2\times 2$ max pooling layer with stride 2 is implemented. The output of the last ReLU activation is fed to a fully connected layer. An output layer with softmax function is implemented last to produce vectors of probabilities of outputs. The PA nonlinearity quotient is resized to $256\times 256$ with 8-bit depth to go to the input layer. Adam optimizer is implemented to reduce the losses. The mini-batch size is 32. The initial training rate is 0.005 and remains unchanged.

Transfer learning retrains a pre-trained classifier on new datasets. In the experiments, the convolution layers of the pre-trained classifier recognize generic RFF patterns. We replaced the fully connected and output layers with new layers. For fine-tuning the transferred classifier, the training rate was configured to 0.0001, and the learning rate factor of the new layers was configured to 20.

V Results and Discussion

V-A Device Classification

The base classifier was trained firstly using complete legitimate device (DUT: “A” to “T”) datasets in the anechoic chamber. Smaller training sets, including 50, 100, 150, and 200 packets, were randomly selected for each legitimate device from the indoor and outdoor datasets to implement the transfer learning. The conventional deep learning and spectrogram-based classifiers were trained as the comparison. The same test sets, including more than 1000 packets per DUT, were implemented to validate the proposed PA nonlinearity quotient and transfer learning classifier and the deep learning and spectrogram-based classifier. No training set packets were used in the test sets.

Fig. 5 shows the device classification results of indoor experiments. The proposed PA nonlinearity quotient and transfer learning classifier outperformed the conventional deep learning and spectrogram-based classifier with an improvement of $33.3\%$ average classification accuracy. More training packets lead to higher classification accuracy. The highest accuracy is $99.4\%$ , with 200 packets retraining the base classifier. The PA nonlinearity quotient improved the average classification accuracy by $19.4\%$ compared to the spectrogram-based classifier.

Fig. 5 shows the classification results of outdoor experiments. The proposed PA nonlinearity quotient and transfer learning classifier outperformed the conventional deep learning and spectrogram-based classifier with an improvement of $34.5\%$ average classification accuracy. The PA nonlinearity quotient improved the average classification accuracy by $10.9\%$ compared to the spectrogram-based classifier.

Table III compares device classification performance among the proposed classifier and recent notable works in literature. The PA nonlinearity quotient and transfer learning classifier achieved high device classification accuracy while requiring fewer training samples and reducing the disk and memory storage requirements.

V-B Rogue Device Detection

The training sets to retrain the base classifier included 100 randomly selected packets per DUT for studying the rogue device detection for the proposed classifier. Deep learning and spectrogram-based classifiers were trained for comparison. The test sets included more than 1000 packets per DUT and more than 1000 packets for each rogue device (DUT: ”Attacker 1” to ”Attacker 5”). No training set packets were used in the test sets.

Fig. 7 shows the ROC curves for the indoor experiments. The proposed PA nonlinearity quotient and transfer learning classifier outperformed the deep learning and spectrogram-based classifier, with an AUC value of 0.992 compared to 0.939. Fig. 7 shows the outdoor experiment results. Similar to the indoor experiments, the proposed classifier improved the AUC significantly. The PA nonlinearity quotient was more robust to environmental changes than the spectrogram, with larger AUC values in the indoor and outdoor experiments.

VI Conclusion

In this paper, we investigated the technique to make RFFI resilient to environmental changes. We proposed the PA nonlinearity quotient and transfer learning classifier that mitigates channel effects to enhance the RFFI implementation for device classification and rogue device detection. Extensive experiments, including indoor and outdoor settings, were developed to evaluate the proposed classifier. The experiment results demonstrated that the proposed classifier significantly improved classification accuracy and rogue device detection for RFFI. The PA nonlinearity quotient outperformed the spectrogram to enhance RFFI in indoor and outdoor settings.

Bibliography42

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] V. Hassija, V. Chamola, V. Saxena, D. Jain, P. Goyal, and B. Sikdar, “A survey on Io T security: Application areas, security threats, and solution architectures,” IEEE Access , vol. 7, pp. 82 721–82 743, Jun. 2019.
2[2] Y. Yang, L. Wu, G. Yin, L. Li, and H. Zhao, “A survey on security and privacy issues in Internet-of-Things,” IEEE Internet Things J. , vol. 4, no. 5, pp. 1250–1258, Oct. 2017.
3[3] Q. Xu, R. Zheng, W. Saad, and Z. Han, “Device fingerprinting in wireless networks: Challenges and opportunities,” IEEE Commun. Surveys Tuts. , vol. 18, no. 1, pp. 94–104, Sep. 2015.
4[4] S. Riyaz, K. Sankhe, S. Ioannidis, and K. Chowdhury, “Deep learning convolutional neural networks for radio identification,” IEEE Commun. Mag. , vol. 56, no. 9, pp. 146–152, Sep. 2018.
5[5] J. Zhang, S. Rajendran, Z. Sun, R. Woods, and L. Hanzo, “Physical layer security for the Internet of things: Authentication and key generation,” IEEE Wireless Commun. , vol. 26, no. 5, pp. 92–98, May 2019.
6[6] K. Sankhe, M. Belgiovine, F. Zhou, L. Angioloni, F. Restuccia, S. D’Oro, T. Melodia, S. Ioannidis, and K. Chowdhury, “No radio left behind: Radio fingerprinting through deep learning of physical-layer hardware impairments,” IEEE Trans. Cognit. Commun. Netw. , vol. 6, no. 1, pp. 165–178, Mar. 2020.
7[7] R. Xie, W. Xu, Y. Chen, J. Yu, A. Hu, D. W. K. Ng, and A. L. Swindlehurst, “A generalizable model-and-data driven approach for open-set RFF authentication,” IEEE Trans. Inf. Forensics Security , vol. 16, pp. 4435–4450, Aug. 2021.
8[8] K. Merchant, S. Revay, G. Stantchev, and B. Nousain, “Deep learning for RF device fingerprinting in cognitive communication networks,” IEEE J. Sel. Topics Signal Process. , vol. 12, no. 1, pp. 160–167, Feb. 2018.